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Methods for Obtaining Plant Varieties 

TECHNICAL FIELD 

The present invention relates to nucleotide sequences which encode polypeptides 
involved in the DNA mismatch repair systems of plants, and to the polypeptides encoded 
by those nucleotide sequences. The invention also relates to nucleotide sequences and 
polypeptide sequences for use in altering the DNA mismatch repair system in plants. The 
invention also relates to a process for altering the DNA mismatch repair system of a plant 
cell, to a process for increasing genetic variations in plants and to processes for obtaining 
plants having a desired characteristic. 

BACKGROUND OF THE INVENTION 

Plant breeding essentially relies on and makes use of genetic variation which occurs 
naturally within and between members of a family, a genus, a species or a subspecies. 
Another source of genetic variation is the introduction of genes from other organisms 
which may or may not be related to the host plant. 

Allelic loci or non-allelic genes which constitute or contribute to desired quantitative 
(e.g. growth performance, yield, etc.) or qualitative (e.g. deposition, content and 
composition of seed storage products; pathogen resistance genes: etc.) traits that are 
absent, incomplete or inefficient in a species or subspecies of interest are typically 
introduced by the plant breeder from other species or subspecies, or de novo. This 
introduction is often done by crossing, provided that the species to be crossed are sexually 
compatible. Other means of introducing genomes, individual chromosomes or genes into 
plant cells or plants are well known in the an. They include cell fusion, chemically aided 
transfection (Schocher et al., 1986, Biotechnology 4: 1093) and ballistic (McCabe et al. f 
1988, Biotechnology 6: 923), microinjection (Neuhaus et al., 1987, TAG 75: 30), 
electroporation of protoplasts (Chupeau et al., 1989, Biotechnology 7: 53) or microbial 
transformation methods such as Agrobacterium mediated transformation (Horsch et al., 
1985, Science 227: 1229; Hiei et al., 1996, Biotechnology 14: 745). 

However, when a foreign genome, chromosome or gene is introduced into a plant, it 
will often segregate in subsequent generations from the genome of the recipient plant or 
plant cell during mitotic and meiotic cell divisions and, in consequence, become lost from 
the host plant or plant cell into which it had been introduced. Occasionally, however, the 
introduced genome, chromosome or gene physically combines entirely or in part with the 
genome, chromosome or gene of the host plant or plant cell in a process which is called 
recombination. 

Recombination involves the exchange of covalent linkages between DNA molecules 
in regions of identical or similar sequence. It is referred to here as homologous 
recombination if donor and recipient DNA are identical or nearly identical (at least 99% 
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base sequence identity), and as homeologous recombination if donor and recipient DNA 
are not identical but are similar (less than 99% base sequence identity). 

The ability of two genomes, chromosomes or genes to recombine is known to 
depend largely on the evolutionary relation between them and thus on the degree of 
sequence similarity between the two DNA molecules. Whereas homologous 
recombination is frequently observed during mitosis and meiosis, homeologous 
recombination is rarely or never seen. 

From a breeder's perspective, the limits within which homologous recombination 
occurs , therefore, define a genetic barrier between species, varieties or lines, in contrast 
to homeologous recombination which can break this barrier. Homeologous recombination 
is thus of great importance for plant breeding. Accordingly there is a need for a process 
for enhancing the frequency of homeologous recombination in plants. In particular, there 
is a need for a process of increasing homeologous recombination to significantly shorten 
the length of breeding programs by reducing the number of crosses required to obtain an 
otherwise rare recombination event. 

At least in Escherichia coli. homologous and homeologous recombination are known 
to share a common pathway that requires among others the proteins RecA, RecB, RecC, 
RecD and makes use of the SOS induced RuvA and RuvB, respectively. It has been 
suggested that mating induced recombination follows the Double-Strand Break Repair 
model (Szostak et al., 1983, Cell 33, 25-35), which is widely used to describe genetic 
recombination in eukaryotes. Following the alignment of homologous or homeologous 
DNA double helices the RecA protein mediates an exchange of a single DNA strand from 
the donor helix to the aligned recipient DNA helix. The incoming strand screens the 
recipient helix for sequence complementarity, seeking to form a heteroduplex by hydrogen 
bonding the complementary strand. The displaced homologous or homeologous strand of 
the recipient helix is guided into the donor helix where it base pairs with its counterpart 
strand to form a second heteroduplex. The resulting branch point then migrates along the 
aligned chromosomes thereby elongating and thus stabilising the initial heteroduplexes. 
Single stranded gaps (if present) are closed by DNA synthesis. The strand cross overs 
(Holliday junction) are eventually resolved enzymatically to yield the recombination 
products. 

Although in wild type' E. coli homologous and homeologous recombination are thus 
mechanistically similar if not identical, homologous recombination in conjugational 
crosses £. coli x £. coli occurs five orders of magnitude more frequently than 
homeologous recombination in conjugational crosses £. coli x 5. typhimurium (Matic et 
al. 1995; Cell 80, 507-515). The imbalance in favour of homologous recombination was 
shown to be caused largely by the bacterial MisMatch Repair (MMR) system since its 
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inactivation increased the frequency of homeologous recombination in E. coli up to 1000 
fold (Rayssiguier et aL 1989, Nature 342. 396-401). 

In E. coli. the MMR system (reviewed by Modrich 1991, Annual Rev Genetics 25, 
229-253) is composed of only three proteins known as MutS, MutL and MutH. MutS 
5 recognizes and binds to base pair mismatches. MutL then forms a stable complex with 
mismatch bound MutS. This protein complex now activates the MutH intrinsic single 
stranded endonuclease which nicks the strand containing the misplaced base and thereby 
prepares the template for DNA repair enzymes. 

During recombination, MMR components inhibit homeologous recombination. In 
io vitro experiments demonstrated tliat MutS in complex with MutL binds to mismatches at 
the recombination branch point and physically blocks RecA mediated strand exchange and 
heteroduplex formation (Worth et al., 1994; PNAS 91, 3238-3241). Interestingly, the 
SOS dependent RuvAB mediated branch migration is insensitive to MutS/MutL, 
explaining the observed slight increase in SOS dependent homeologous recombination, 
is Homeologous mating even induces the SOS response, thereby taking advantage of RuvAB 
induction (Matic et aL 1995, Cell 80. 507-515). 

The MMR system thus appears to be a genetic guardian over genome stability in E. 
coli. In this role it essentially determines the extent to which genetic isolation, that is, 
speciation, occurs. The diminished sensitivity of the SOS system to MMR, however, 
20 allows (within limits) for rapid genomic changes at times of stress, providing the means 
for fast adaptation to altered environmental conditions and thus contributing to intraspecies 
genetic variation and species evolution. 

The important role of MMR in preserving genomic integrity has been established 
also in certain eukaryotes. In its efficiency, the human MMR, for example, may even 

25 counteract potential gene therapy tools such as triple-helix forming oligonucleotides 
including RNA-DNA hybrid molecules (Havre et al., 1993, J. Virology 67: 7234-7331; 
Wang et al., 1995, Mol. Cell. Biol. 15: 1759-1768; Kotani et al., 1996, Mol. Gen. 
Genetics 250: 626-634; Cole-Strauss et al., 1996, Science 273: 1387-1389). Such 
oligonucleotides are designed to introduce single base changes into selected DNA target 

30 sequences in order to inactivate for example cancer genes or to restore their normal 
function. The resulting base mismatches however are recognised by the mismatch repair 
system which then directs removal of the mismatched base, thereby reducing the 
efficiency of oligonucleotide induced site-specific mutagenesis. 

To date, two families of related genes, homologous to the bacterial MutS and MutL 
35 genes have been identified or isolated in yeast and mammals (recent reviews by Arnheim 
and Shibata, 1997, Curr. Opinion Genet. Dev. 7, 364-370; Modrich and Lahue, 1996, 
Annual Rev. Biochem. 65, 101-133; Umar and Kunkel, 1996, Eur. J. Biochem. 238, 297- 
307). Biochemical and genetic analysis indicated that eukaryotic MutS homologs (MSH) 
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and MutL homologs (MLH, PMS), respectively, fulfil similar protein functions as their 
bacterial counterparts. Their relative abundance, however, could reflect different 
mismatch specificity and/or specialisation for different tissues or organelles or 
developmental processes such as mitotic versus meiotic recombination. 

5 To date, six different genes homologous to MutS have been isolated in yeast 
(yMSH). and their homologs have been found in mouse (mMSH) and human (hMSH), 
respectively. Encoded proteins yMSH2, yMSH3 and yMSH6 appear to be the main MutS 
homologs involved in MMR during mitosis and meiosis in yeast, where the 
complementary proteins MSH3 and MSH6 alternatively associate with MSH2 to recognise 

10 different mismatch substrates (Masischky et al., 1996, Genes Dev. 10, 407-420). Similar 
protein interactions have been demonstrated for the human homologs hMSH2, hMSH3 
and hMSH6 (Acharya et al., 1996, PNAS 93, 13629-13634). 

MutL homologs (MLH and PMS), recently reviewed by Modrich and Lahue (1996, 
Annual Rev. Biochem. 65. 101-133) have so far been found in yeast (yMLHl and 

is yPMSl), mouse (mPMS2) and human (hMLHl. hPMSl and hPMS2). The hPMS2 is a 
member of a family of at least 7 genes (Horii et al., 1994, Biochem. Biophys. Res. 
Commun. 204, 1257-1264) and its gene product is most closely related to yPMSl. Proila 
et al. (1994, Science 265, 1091-1093) presented evidence for yPMSl and yMLHl to 
physically associate with each other and, together, to interact with the MutS homolog 

20 yMSH2 to form a ternary complex involved in mismatch substrate binding. 

However, while medical interest in mismatch repair has prompted extensive research 
on MMR in bacteria, yeast and mammals, MMR genes have not been isolated from higher 
plants prior to the present invention and no attempts to adjust the plant MMR to plant 
breeding needs have been reported. 

25 SUMMARY OF THE INVENTION 

According to a first embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence encoding a polypeptide 
functionally involved in the DNA mismatch repair system of a plant. In one form of this 
embodiment, the invention provides an isolated and purified DNA molecule comprising a 
30 polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human. More particularly, the invention provides 
polynucleotide sequences encoding polypeptides which are homologous to the mismatch 
repair polypeptides MSH3 and MSH6 of Saccharomyces cerevisiae. Still more 

particularly, the invention provides the coding sequences of the genes AtMSH3 and 

v 

35 AtMSH6 of Arabidopsis thaliana. as defined hereinbelow, and polynucleotide sequences 
encoding polypeptides which are homologous to polypeptides encoded by AtMSH3 and 
AtMSH6. 
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According to a second embodiment of the invention, there is provided an isolated 
and purified polypeptide functionally involved in the DNA mismatch repair system of a 
plant, for example a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human such as a polypeptide encoded by the genes AiMSH3 or AtMSH6 of 
Arabidopsis thaliana, as defined hereinbelow. 

According to a third embodiment of the invention, there is provided an isolated and 
purified DNA molecule comprising a polynucleotide sequence selected from the group 
consisting of (i) a sequence encoding a polynucleotide which is capable of interfering with 
the expression of a plant polynucleotide sequence encoding a polypeptide which is 
homologous to a mismatch repair polypeptide of a yeast or of a human and thereby 
disabling said plant polynucleotide sequence; and (ii) a sequence encoding a polypeptide 
capable of disrupting the DNA mismatch repair system of a plant. 

According to a fourth embodiment of the invention there is provided a chimeric gene 
comprising a DNA sequence selected from the group consisting of (i) a sequence encoding 
a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence, and (ii) a sequence encoding a polypeptide capable of disrupting the DNA 
mismatch repair system of a plant: together with at least one regulation element capable of 
functioning in a plant cell. Examples of such regulation elements include constitutive, 
inducible, tissue type specific and cell type specific promoters such as 35S, NOS, PRla, 
AoPRl and DMC1. Typically, a chimeric gene of the fourth embodiment will also 
include at least one terminator sequence, more typically exactly one terminator sequence. 

In the third and fourth embodiments, said interference, by said polynucleotide 
sequence, with the expression of a plant polynucleotide sequence encoding a polypeptide 
which is homologous to a mismatch repair peptide of a yeast or a human typically occurs 
by hybridisation or by co-suppression. 

According to a fifth embodiment of the invention there is provided a plasmid or 
vector comprising a chimeric gene of the fourth embodiment. A vector of the fifth 
embodiment may be, for example, a viral vector or a bacterial vector. 

According to a sixth embodiment of the invention, there is provided a plant cell 
stably transformed, transfected or electroporated with a plasmid or vector of the fifth 
embodiment. 

According to seventh embodiment of the invention, there is provided a plant 
comprising a cell of the'sixth embodiment. 

According to an eighth embodiment of the invention, there is provided a process for 
at least partially inactivating a DNA mismatch repair system of a plant cell, comprising 
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transforming or transfecting said plant cell with a DNA sequence of the third embodiment 
or a chimeric gene of the fourth embodiment or a plasmid or vector of the fifth 
embodiment, and causing said DNA sequence to express said polynucleotide or said 
polypeptide. 

5 According to a ninth embodiment of the invention, there is provided a process for 
increasing genetic variation in a plant comprising obtaining a hybrid plant from a first 
plant and a second plant, or cells thereof, said first and second plants being genetically 
different; altering the mismatch repair system in said hybrid plant; permitting said hybrid 
plant to self-fertilise and produce offspring plants; and screening said offspring plants for 

io plants in which homeologous recombination has occurred. For example, homeologous 
recombination may be evidenced by new genetic linkage of a desired characteristic trait or 
of a gene which contributes to a desired characteristic trait. 

According to a tenth embodiment of the invention there is provided a process for 
obtaining a plant having a desired characteristic, comprising altering the mismatch repair 
is system in a plant, cell or plurality of cells of a plant which does not have said desired 
characteristic, permitting mutations to persist in said cells to produce mutated plant cells, 
deriving plants from said mutated plant cells, and screening said plants for a plant having 
said desired characteristic. 

In a preferred form of the ninth and tenth embodiments of the invention, the step of 
20 altering the mismatch repair system comprises introducing into said hybrid plant, plant, 
cell or cells a chimeric gene of the fourth embodiment and permitting the chimeric gene to 
express a polynucleotide which is capable of interfering with the expression of a plant 
polynucleotide sequence in a mismatch repair gene of the hybrid plant, plant, cell or cells, 
or a polypeptide capable of disrupting the DNA mismatch repair system of the hybrid 
25 plant or cells. 

In other embodiments, the invention provides (a) an oligonucleotide capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule of the first 
embodiment; (b) an oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO: 18 and (c) an oligonucleotide capable of 
30 hybridising at 45°C under standard PCR conditions to the DNA of SEQ ID NO:30; with 
the proviso that the oligonucleotide of (a), (b) and (c) is other than SEQ ID N0:1 or SEQ 
ID N0:2. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure I provide^ a diagrammatic representation of the primer sequences used to 
35 isolate AtMSH3. 

Figure 2 is a plasmid map of clone 52, showing restriction enzyme cleavage sites in 
the 5' half of the ftill-length cDNA for AtMSH3. 
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Figure 3 is a plasmid map of clone 13, showing restriction enzyme cleavage sites in 
the 3' half of the full-length cDN A for AtMSH3. 

Figure 4 is a sequence listing of the coding sequence of AtMSH3< together with a 
deduced sequence of the encoded polypeptide. 
5 Figure 5 is a protein alignment of yeast (Saccharomyces cerevisiae) and Arabidopsis 

thaliana MSH3 protein . 

Figure 6 provides a diagrammatic representation of the primer sequences used to 

isolate AtMSH6. 

Figure 7 is a plasmid map of clone 43, showing restriction enzyme cleavage sites in 
10 the 5' half of the full-length cDNA for AtMSH6. 

Figure 8 is a plasmid map of clone 62, showing restriction enzyme cleavage sites in 
the 3' half of the full-length cDNA for AtMSH6. 

Figure 9 is a sequence listing of the coding sequence of AtMSH6, together with a 
deduced sequence of the encoded polypeptide, 
is Figure 10 is a protein alignment of yeast (Saccharomyces cerevisiae) and 
Arabidopsis thaliana MSH6 protein. 

Figure 11 is a genomic sequence listing of AtMSH6. 
Figure 12 is a plasmid map of plasmid pPF13. 
Figure 13 is a plasmid map of plasmid pPF14. 
20 Figure 1 4 is a plasmid map of plasmid pC W 1 86. 

Figure 1 5 is a plasmid map of plasmid pC W 1 87. 
Figure 1 6 is a plasmid map of plasmid pPF66. 
Figure 1 7 is a plasmid map of plasmid pPF57. 

Figure 18 is a diagrammatic representation of an antisense gene construction for use 
25 in homeologous meiotic recombination. 

Figure 19 is a plasmid map of plasmid p3243. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is based on the inventors' discovery that there exist in higher 
plants genes which are homologous to MMR genes in E. coli, and to MMR genes in 
30 yeasts and humans. 

Thus, the inventors have identified genes, herein designated AtMSH3 and AtMSH6 y 
of the plant Arabidopsis thaliana which encode the proteins AtMSH3 and AtMSH6. 
These plant proteins are homologous to yMSH3 and yMSH6, respectively. The present 
inventors have isolated cDNAs encoding the proteins AtMSH3 and AtMSH6 and have 
35 isolated the complete gene encoding AtMSH6. Given the teaching herein, other genes 
(for example AtMSH2, and genes of other plants) may be obtained which are involved in 
DNA mismatch repair in plants, including other genes which encode polypeptides 
homologous to MMR proteins of yeasts or humans, such as genes which encode 
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polypeptides homologous to yeast MSH2. MLH1 or PMS2. or to human MLH1. PMS1 or 
PMS2. For example, given the teaching herein, genes of members of the Brassicaceae 
family or of other unrelated families, for example the Poaceae. the Solanaceae. the 
Asteraceae, the Malvaceae, the Fabaceae, the Linaceae* the Canabinaceae, the Dauaceae 
5 and the Cucurbitaceae family, and which encode polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained. 

Examples of plants whose genes encoding polypeptides homologous to MMR 
proteins of yeasts or humans may be obtained given the teaching herein include maize, 
wheat, oats, barley, rice, tomato, potato, tobacco, capsicum, sunflower, lettuce, 
10 artichoke, safflower, cotton, okra, beans of many kinds including soybean, peas, melon, 
squash, cucumber, oilseed rape, broccoli, cauliflower, cabbage, flax, hemp, hops and 
carrot. 

Within the meaning of the present invention, a first polypeptide is defined as 
homologous to a second polypeptide if the amino acid sequence of the first polypeptide 
15 exhibits a similarity of at least 50% on the polypeptide level to the amino acid sequence of 
the second polypeptide. 

A procedure which may be followed to obtain genes AtMSH3 and AtMSH6 is 
described in Example L Essentially the same technique may be applied to obtain other 
mismatch repair genes of Arabidopsis thaliana, and essentially the same technique as 

20 exemplified herein may be applied to cDNA obtained by reverse transcription of RNA 
from other plants. Alternatively, given the sequence information disclosed herein, other 
degenerate oligonucleotide primers, especially oligonucleotides of the invention which are 
capable of hybridising at 45°C under standard PCR conditions (such as the conditions 
described in Example 1 using primers UPMU and DOMU) to AtMSH3 and/or AtMSH6 

25 may be designed and obtained for use in isolating sequences of plant mismatch repair 
genes which are homologous to AtMSH3 or AtMSH6, from other plants. Similarly, 
oligonucleotides of the invention which are capable of hybridising at 45°C under standard 
PCR conditions to plant mismatch repair genes of plants other than Arabidopsis thaliana 
also fall within the scope of the present invention and may be utilised to obtain mismatch 

30 repair genes of still other plants. Typically, such oligonucleotides are capable of 
hybridising at 45°C under standard PCR conditions to a DNA molecule which encodes a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or a human. 
The temperature at which oligonucleotides of the invention hybridise to AtMSH3 and/or 
AtMSH6, or to plant mismatch repair genes of plants other than Arabidopsis thaliana, or 

35 to DNA molecules which encode polypeptides which are homologous to a mismatch repair 
polypeptide of a yeast or a human may be higher than 45°C, for example at least 50°C, or 
at least 55°C, or at least 60°C or as high as 65°C. 
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The successful gene isolation disclosed herein demonstrates for the first time the 
existence of MMR in higher plants and indicates the presence of other plant MMR genes. 
For example, genes encoding the plant* homologs of MSHL MSH2. MSH4, MSH5, 
PMSL PMS2 and MLH1 may be identified given the teaching herein. Such genes, as 

5 well as those specifically described herein, separately or in combination, are useful in 
manipulating the plant MMR for plant breeding purposes. Thus, for example, the plant 
MMR may be altered by including in a plant cell a polynucleotide sequence as defined 
herein above with reference to the third embodiment of the invention, and causing the 
polynucleotide sequence to express either a polynucleotide which disables a plant MMR 

io gene, or a polypeptide which disrupts the plant's MMR system. 

The DNA molecule of the third embodiment of the invention includes a 
polynucleotide sequence (herein referred to as a MMR altering gene) which may for 
example encode sense, antisense or ribozyme molecules characterised by sufficient base 
sequence similarity or complementarity to the gene to be altered to permit the antisense or 
is ribozyme molecule to hybridise with the plant MMR gene in vivo or to permit the sense 
molecule to participate in co-suppression. Alternatively, the MMR altering gene may 
encode a protein or proteins which interfere with the activity of a plant MMR protein and 
thus disrupt the plant's MMR system. For example, such encoded proteins may be 
antibodies or other proteins capable of interfering with MMR protein function, such as by 
20 complexing with a protein functionally involved in plant MMR thereby disrupting the 
MMR of the plant. An example of such a protein is the MSH3 protein of Arabidopsis 
thaliana described herein or a protein of another plant which is homologous to the MSH3 
protein of A. thaliana. For instance, overexpression of MSH3 in a plant cell causes 
MSH2 present in the cell to be substantially completely complexed, disrupting the 
25 mismatch repair mechanism or mechanisms in the cell which are functionally dependent 
on the presence of a complex of MSH2 with MSH6. Similarly, mismatch repair 
mechanisms which depend on the presence of a complex of MSH2 and MSH3 may be 
disrupted by the overexpression of MSH6. 

A chimeric gene of the fourth embodiment, incorporating a MMR altering gene, 
30 may be prepared by methods which are known in the art. Similarly, the MMR altering 
gene may be introduced into a plant cell, regenerating tissue or whole plant by techniques 
known in the art as being suitable for plant transformation, or by crossing. Known 
transformation techniques include Agrobacterium tumefaciens or A, rhizogenes mediated 
gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

35 The MMR altering gene or genes are typically expressed from suitable promoters. 
Suitable promoters may direct constitutive expression, such as the 35S or the NOS 
promoter. Usually, however, the promoter will direct either inducible or tissue specific 
(e.g. callus; embryonic tissue; etc.), cell type specific (e.g. protoplasts; meiocytes; etc.) 
or developmental (e.g. embryo) expression of the altering gene or genes, in order for the 
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MMR system to function in tissue types or cell types, or at developmental stages of the 
plant, in which it is not desirable for the MMR system to be altered. Using such 
promoters, therefore, the activity of a MMR altering gene may be limited to a specific 
staue during plant development or it may be altered by controlling conditions external to 

5 the plant, and the deleterious effects of a permanently disabled or altered DNA mismatch 
repair system in a plant may be avoided. Examples of suitable promoters which are not 
constitutive are known in the art and include inducible promoters such as PKla (reviewed 
by Gatz, 1997, Annual Rev. Plant Phys. Plant Mol. Biol. 48: 89), tissue specific 
promoters such as AoPRl (Sabahattin et al., 1993, Biotechnology 11: 218), and cell-type 

io specific promoters such as DMCl. 

A chimeric gene in accordance with the invention may further be physically linked 
to one or more selection markers such as genes which confer phenotypic traits such as 
herbicide resistance, antibiotic resistance or disease resistance, or which confer some 
other recognisable trait such as male sterility, male fertility, grain size, colour, growth 
is rate, flowering time, ripening time. etc. 

The process of the tenth embodiment of the invention provides, for example, a 
process for generating intraspecies genetic variation by altering the mismatch repair 
system in a plant cell, in regenerating plant tissue or in a whole plant. The plant cell, 
regenerating tissue or whole plant includes and expresses one or more MMR altering 

20 genes which are capable of altering mismatch repair in the plant cell, regenerating tissue 
or whole plant. Alteration of MMR may be achieved, for example, by inactivating the 
genes encoding plant MSH3 and/or plant MSH6. It is preferred to inactivate the plant 
MSH3 and MSH6 encoding genes at the same time and in the same plant cell, 
regenerating tissue or whole plant. Typically in this preferred form of the invention 

25 inactivation of either plant MSH3 or MSH6 alone is insufficient to substantially alter the 
plant's mismatch repair system and only when both MSH3 and MSH6 are inactivated 
simultaneously is the plant's mismatch repair system sufficiently altered to prevent the 
MMR system from recognising base pair mismatches, base insertions or deletions as a 
result of DNA replication errors, DNA damage, or oligonucleotide induced site-specific 

30 mutagenesis. However, in some applications of the invention, inactivation of only one 
gene may also be used to cause genomic instability or increase the efficiency of site- 
specific mutagenesis. , 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the genome of the plant cell, regenerating tissue or 
35 whole plant in order to, restore mismatch repair in the plant cell, regenerating tissue or 
whole plant. The MMR altering gene or genes may be inactivated by means of known 
gene inactivation tools, such as ribozymes, or may be removed from the genome using 
gene elimination systems known in the an, such as CRE/LOX. It is preferred to render 
two eenes. whose gene products combine to incapacitate MMR, ineffective by separating 
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the altering genes through segregation. Therefore, in a preferred embodiment of the 
invention a first plant cell or plant is generated in which only plant MSH3 is incapacitated, 
and a second plant cell or plant is generated in which only plant MSH6 is incapacitated. 
The combination of both genomes, lor example by crossing, then produces significant 
5 MMR deficiency in those cells or plants which have inherited both altering genes. If the 
altering genes are expressed from unlinked loci, gene segregation restores MMR activity 
in the progeny of the cells or plants. 

In a process of the ninth embodiment of this invention, homeologous recombination 
is enhanced between different genomes, chromosomes or genes in plant cells or plants by 
10 altering MMR in said plant cells or plants. Such genomes, chromosomes or genes are 
characterised in that they originate from different plant families, genera, species, 
subspecies, plant varieties or lines. Hybrid plant cells or hybrid plants may be produced 
by crossing, by cell fusion or by other techniques known in the art. These plant cells or 
plants are further characterised by expressing one or more genes that are capable of 
15 altering mismatch repair in the plant cell or plants. 

In the process of the ninth embodiment, the homeologous recombination is typically 
for the purpose of introducing a desired characteristic in the hybrid plant. In this typical 
application of the process of the ninth embodiment, and in the process of the tenth 
embodiment the desired characteristic may be any characteristic which is of value to the 
20 plant breeder. Examples of such characteristics are well known in the art and include 
altered composition or quality of leaf or seed derived storage products (e.g. oil, starch, 
protein), altered composition or quality of cell walls (e.g. decrease in lignin content), 
altered growth rate, prolonged flowering, increased plant yield or grain yield, altered 
plant morphology, resistence to pathogens, tolerance to or improved performance under 
25 environmental stresses of various kinds, etc. 

In a preferred form of the tenth embodiment, an MMR altering gene is co- 
introduced along with the homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant to produce a 
hybrid plant cell or hybrid plant in which homeologous recombination can occur. 
30 Suitably, the MMR proficient plant cell or MMR proficient plant may also include an 
MMR altering gene. For example a gene capable of inactivating plant MSH3 may be co- 
introduced along with the' homeologous genome, chromosome or gene of another plant 
cell or plant into an MMR proficient plant cell or MMR proficient plant in which MSH6 
is inactivated. A resultant hybrid plant in which homeologous recombination occurs will 
35 include both the MSH? and MSH6 altering genes and its MMR system will therefore be 
inactivated. 

In this form of the invention, if hybrid plants are to be produced by crossing, the 
MMR altering gene preferably originates from the male parent, thus ensuring that the 
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MMR altering gene is always introduced and is not present in the recipient cell. That is, 
the MMR of the recipient cell, prior to introduction of the MMR altering gene, is 
typically proficient. Alternatively, if an MMR altering gene is present in a recipient cell it 
may be ineffective or inefficient on its own. or it may be linked to an inducible or tissue 
5 specific or cell type specific promoter which only renders the MMR altering gene active 
under limited conditions. 

Thus, in a preferred form of the process of the ninth embodiment, the MMR system 
of the hybrid plant is initially unaltered. In this form of the process, the step of altering 
the mismatch repair system may comprise introducing into the hybrid plant, or cells 
10 thereof, a MMR altering gene, such as by Agrobacterium tumefaciens or A. rhizogenes 
mediated gene transfer, ballistic and chemical methods, and electroporation of protoplasts. 

The MMR altering gene or genes are typically expressed from suitable promoters, as 
described above. Preferably, the promoter is transcriptionally active in mitotically and 
meiotically active tissue and/or cells to ensure MMR alteration after chromosome pairing 

is at mitosis and meiosis, respectively. The preferred timing for MMR alteration is at 
meiosis. because recombinant genomes, chromosomes or genes are directly transmitted to 
the progeny. A suitable meiocyte specific promoter is for example the DMCl promoter 
from Arabidopsis thaliana ssp. Ler. (Klimyuk and Jones, 1997, Plant J. 11, 1-14). 
However; mitotic homeologous recombination is also a desirable outcome as somatic 

20 recombination events can be transmitted to offspring due to the totipotency of plant cells 
and the lack of predetermined germ cells in plants. 

If desired, the MMR altering gene or genes may later be rendered non-functional or 
ineffective, or may be removed from the hybrid plant or hybrid plant cells, in order to 
restore mismatch repair in the hybrid plant or hybrid plant cells. The MMR altering gene 

25 or genes may be inactivated by means of known gene inactivation tools as described 
herein above. 

EXAMPLES 

Example 1. Cloning of the AtMSH3 and AtMSH6 coding sequences 

Isolation of partial AtMSH3 and AtMSH6 consensus sequences 

30 Degenerate oligonucleotides UPMU (SEQ ID NO: 1 ) and DOMU (SEQ ID NO:2) 

UPMU CTGGATCCACIGGICC1AA(C/T)ATG 
DOMU CTGGATCC(A/G)TA(A/G)TGIGTI(A/G)C(A/G)AA 
were used to isolate AtA>{SH3 and AtMSH6 sequences by PCR amplification. 

Primers UPMU ahd DOMU correspond to conserved amino acid sequences of the 
35 proteins MutS (£. coli and S. typhimurium), HexA (S. pneumoniae). Repl (mouse) and 
Duel (human). The conserved regions to which they are targeted are TGPNM for UPMU 
^minn acid oositions 852-856 for AtMSH6 and 816-820 for AtMSH3) FATHY or FVTHY 
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for DOMU famine acid positions 964-968 tor AtMSH6 and 928-932 for AtMSH3, 
respectively.) These primers have been used to isolate MSH2 and MSHI from yeast 
(Reenan and Kolodner. Genetics 132: 963-973 ( 1992)) and MSH2 from Xenopus and mouse 
{ Varlet et aL Nuc. Acids Res. 22:5723-5728 (1994)). 
5 Template single strand cDNA was produced by reverse transcription of 2 jag total 
RNA from a cell suspension culture of Arabidopsis thaliana ecotype Columbia (Axelos et 
al. 1989, MoL Gen. Genetics 219: 106-112). The PCR reaction was performed under the 
following conditions in a final volume of 100^1: 0.2mM dNTP, t|iM each primer, 1XPCR 
buffer, lu Tag DNA polymerase (Appligene) in the presence of template cDNA. PCR 
10 parameters were 5 minutes at 94°C, followed by 30 cycles of 40 seconds at 95°C, 90 
seconds at 45°C, 1 minute at 72°C. The amplification products were cloned into pGEM-T 
vector (Promega) and sequenced. Two different clones were isolated, S5 (350bp) was 
homologous to MSH3, S8 (327bp) was homologous to MSH6. Complete cDNA sequences 
were then isolated according to the Marathon cDNA amplification kit procedure (Clontech). 
is In summarv. this procedure involves producing double stranded cDNA by reverse 
transcription of 2|ig polyA^ RNA from the cell suspension culture of Arabidopsis. 
Adaptors are ligated on each side of the cDNA. The ligated cDNA is used as a template for 
5* and y RACE PCR reactions in the presence of primers that are specific for the adaptor on 
one side (API and AP2), and specific for the targeted gene on the other side. A 5 1 and a 3' 
20 fragment that overlap are thus produced for each gene. The complete gene coding sequence 
can be reconstituted taking advantage of a unique restriction site, if available, in the 
overlapping region. Specific details of this procedure as it was used to isolate AiMSH3 and 
A(MSH6 coding regions, are as follows. 

Isolation of AtMSH3 complete coding sequence 

25 From the sequence of clone S5, primer 636 (SEQ ID NO:3) was designed: 
636 TGCTAGTGCCTCTTGCAAGCTCAT. 
Primer API (SEQ ID NO:4) is complementary to a portion of an adaptor sequence 
which had been ligated to the 5' and 3' ends of Arabidopsis cDNA: 

AP 1 CCATCCTAATACGACTCACTATAGGGC. 
30 PCR performed on the ligated cDNA with primers 636 and API for the 5' RACE PCR 
was followed by a second round of amplification with the nested primers AP2 (SEQ ID 
NO:5) and S525 (SEQ ID ^0:6) 

AP2 ACTCACTATAGGGCTCGAGCGGC 
S525 AGGTTCTGATTATGTGTGACGCTTTACTTA 
35 (the latter was also designed to correspond to a part of the sequence of clone S5) and 
produced a 2720bp DNA fragment. Figure I provides a diagrammatic representation of the 
primer sequences used to isolate AtMSH3. Another primer (S5 1, SEQ ID NO:7) 
S5 1 GGATCGGGTACTGGGTTTTGAGTGTGAGG 
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was designed closer to the 5" border and permitted the determination of 99bp upstream to 
the A T(i initiation eodon. For the 3' RACE PCR. a first PCR reaction was performed with 
primers API and 635 (SEQ ID N():8). 

635 GCACGTGCTTGATGGTGTTTTCAC 
5 followed by a second round of amplification, using the nested primers AP2 and S523 (SEQ 
IDN0:9) 

S523 TCAGACAGTATCCAGCATGGCAGAAGTA 
which produced a DNA fragment of 890bp. Both DNA fragments were subcloned into 
pGEM-T and sequenced. Since PCR amplification using the Expand Long Template PCR 
io System (Boehringer-Mannheim) produced errors in the sequence, new oligonucleotides 
were designed to isolate those sequences again by PCR, but with the high fidelity DNA 
polymerase Pfu. PCR with primers 1S5 (SEQ ID NO: 10) and S53 (SEQ ID NO:l 1) 
1 S5 ATCCCGGGATGGGCAAGCAAAAGCAGCAGACGA 
S53 GACAAAGAGCGAAATGAGGCCCCTTGG 
15 amplified the 1244bp fragment clone 52 (SEQ ID NO: 12. cloned into pUC18/Smal). PCR 
with primers S52 (SEQ ID NO:13) and 2S5 (SEQ ID NO: 14) 

2S5 ATCCCGGGTCAAAATGAACAAGTTGGTTTTAGTC 
S52 GCCACATCTGACTGTTCAAGCCCTCGC 
amplified the 2104bp clone 13 (SEQ ID NO:15, cloned into pUC18/Smal). The complete 
20 coding sequence of the AtMSH3 gene was reconstructed in pUC18 by ligating the 5' half 
of AtMSH3 (clone 52) to the 3" half of AtMSH3 (clone 13) after digesting with BamUl 
which has a unique cleavage site in the overlapping region of both clones. This 
manipulation yielded plasmid pPF26. The Sma\ fragment from pPF26 contains the 
complete AtMSH3 coding sequence. The remaining primers referred to in Figure 1 are as 
25 follows: 

S5 1 GGATCGGGTACTGGGTTTTGAGTGTGAGG (SEQ ID NO:16) 

S525 AGGTTCTGATTATGTGTG ACGCTTTACTTA (SEQ ID NO: 1 7) 
Figures 2 and 3 provide plasmid maps of clones 52 and 13 respectively, showing 
restriction enzyme cleavage sites. The complete AtMSH3 coding sequence (SEQ ID NO: 18) 
30 is 3246bp long and is shown in Figure 4 together with the deduced sequence (SEQ ID 
NO: 19) of the encoded polypeptide. AtMSH3 is clearly homologous to the yeast and mouse 
MSH3 genes. A sequence alignment of polypeptides encoded by AtMSH3 and that encoded 
by Saccharomyces cerevisiae MSH3 is set out in Figure 5. 

Isolation of the AtMSH6 complete coding sequence and genomic sequences 

35 The same procedure allowed isolation of the AtMSH6 cDNA. Figure 6 provides a 
diagrammatic representation of the primer sequences used to isolate AtMSH6. For the 5 
RACE PCR, primers 638 (SEQ ID NO:20) and API (SEQ ID NO:4) 

638 TCTCTACCAGGTGACGAAAAACCG 
allowed the amplification of a 2889 DNA fragment. Primer S81 (SEQ IDNO:21) 
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S8 1 CGTCGCCTTTAGCATCCCCTTCCTTC AC 
helped define the 142bp upstream to the ATG initiation codon. On the 3' side. RACE PGR 
was initially performed with primers S823 (SEQ ID NO:22) and API (SEQ ID NO:4) , 

S823 GCTTGGCGCATCTAATAGAATCATGACAGG 
5 and then with the nested primers 637 (SEQ ID N0.23) and AP2 (SEQ ID NO: 5). 

637 GACAGCGTCAGTTCTTCAGAATGC 
to produce a 774bp DNA fragment. As for AtMSH3, those fragments were cloned and 
sequenced. Re-isolation of the DNA sequence using the high fidelity Pfu polymerase and 
newly designed primers 1S8 (SEQ ID NO:24) and S83 (SEQ ID NO:25) (for the 5' side) led 
io to a 2182 bp DNA fragment identified as clone 43 (SEQ ID NO:26, cloned in 
pUCl8/Smal), and a 1379bp clone identified as clone 62 (SEQ ID NO:27, also cloned in 
pUC18/Smal). 

1 S8 ATCCCGGGATGCAGCGCCAGAGATCGATTTTGT 

2S8 ATCCCGGGTTATTTGGGAACACAGTAAGAGGATT (SEQ ID 

is NO:28) 

582 GCGTTCGATCATCAGCCTCTGTGTTGC (SEQ ID NO:29) 

583 CGCTATCTATGGCTGCTTCGAATTGAG 

Figures 7 and 8 provide plasmid maps of clones 43 and 62 respectively, showing restriction 
enzyme cleavage sites. Clones 43 and 62 were digested by the Xmn\ restriction enzyme for 

20 which a unique site is present in their overlapping region and then ligated. The complete 
A(MSH6 coding sequence (SEQ ID NO:30) is 3330bp long and is shown in Figure 9 
together with the deduced sequence (SEQ ID N0:31) of the encoded polypeptide. AtMSH6 
is clearly homologous to the yeast and mouse .WSV/dgenes. A sequence alignment of 
polypeptides encoded by AtMSH6 and that encoded by Saccharomyces cerevisiae MSH6 is 

25 set out in Figure 10. 

An AtMSH6 genomic sequence was also isolated from a genomic DNA library 
constituted after partial Sau3Al digestion of DNA from the Arabidopsis cell suspension. 
8062bp were sequenced that covered the At\lSH6 gene and show colinearity with the 
cDNA. 16 introns are found scattered along the gene. The complete genomic sequence 

30 (SEQ ID NO:98) is shown in Figure 1 1 . 

Example 2. A measure of somatic variation in MMR deficient plants 
Constructs f 

Constructs with antisense A(MSH3 or antisense AtMSH6 or both AtMSH3/AtMSH6 
under the control of a single 35S promoter have been inserted into the binary vector 
35 pPZP12l (Hajdukiewicz et al.. Plant Mol. Biol. 23. 793-799) between the right and left 
borders of the T-DNA. The pPZP121 plasmid confers chloramphenicol resistance to 
Escherichia coli or Agrobacterium tumefaciens bacteria. The aacCI gene is carried by the 
T-DNA and allows selection of transformed plant cells on gentamycin (Hajdukiewicz et al., 
Plant Mol. Biol. 25, 989-994). For the purpose of expressing antisense constructs, a 35S 
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promoter/terminator cassette with a polylinker was introduced into pPZPl 21 . The 3' ends of 
the considered genes have been chosen since this region seems more efficient for antisense 
inhibition. For AtMSH3 this corresponds to clone 13 (2104bp). for A1MSH6 this 
corresponds to clone 62 (1379bp). Clone 13 comprises 21 04bp of the 3' region that were cut 

5 off the pUCl 8 vector by Sall/Sstl restriction, blunted with T4 DNA polymerase and ligated 
into the T4 DNA polymerase blunted BamWl site of pPZP12l/35S, creating clone pPF13. 
The same procedure was followed for the 3* region of AtMSH6 clone 62 (1379bp) thus 
creating plasmid pPFU. For the double constructs, the 3" region (from clone 62) of 
AtMSHS was introduced ahead of the AtMSH3 region into pPF13 creating pCW186 and 

10 reciprocally, the 3' region of AtMSH3 (from clone 13) was introduced ahead of AtMSH6 into 
pPF14, creating pCW187. 

These constructs were introduced into the Arabidopsis cells (as described below) of 
wildtype Columbia and of the Columbia tester line. 

An alternative strategy to antisense inhibition of AtMSH6 comes from experiments of 
15 Marra et al. (1998. Proc. Natl. Acad. Sci USA 95. 8568-8573) who show that 
overexpression of functional MSH3 results in depletion of MSH6 protein in human cells. 
This depletion may generate a mismatch repair mutant phenotype. 

For the purpose of overexpressing functional AiMSH3 protein in plant cells, the 
complete MSH3 coding region was excised from pPF26 (example 1) by digestion with 

20 Smal, and was inserted into the Sma\ site of pCW164. The resulting construct was named 
pPF66. It contains a complete AtMSH3 gene under the control of the 35S promoter inside 
the left (LB) and right (RB) border of the T-DNA. This T-DNA also contains the hp(2 gene 
for gentamycin selection. Plasmid pPF66 was introduced into Arabidopsis cells as 
described below. One cell clone was selected which clearly overexpressed the AtMSH3 

25 gene as shown by Northern analysis. Figures 12-16 provide plasmid maps of plasmids 
pPF13, pPF14, pCW186, pCW187 and pPF66. respectively. 

Construction of tester construct 

For the purpose of Forward Mutagenesis Assays, a tester construct was built 
containing the coding regions for nptll, codA, uidA. All three genes are driven by the 35S 

30 promoter and are terminated by the 35S terminator. This construct was obtained by 
introducing an EcoRl fragment encoding the codA cassette (2.5kb) and a Hindlll fragment 
encoding the uidA (GUS) cassette (2.4kb) into the pPZPlll vector (Hajdukiewicz et 
al.,1994, Plant Mol Biol 23: 793-799) which already contained the nptll expression cassette. 
This new plasmid was named pPF57. Nptll is used to select for transformed plant cells. 

35 GUS is used to analyse; the degree of gene silencing in the construct (i.e. to identify cell 
lines in which the transgenes are expressed), and codA is used as a marker for forward 
mutagenesis (described below). 
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The plasmid map of pPF57 is provided in Figure 17. 

Plant celt transformation 

The constructs are introduced into Agrohucterium by electroporation. Plant cells are 
then transformed by co-cultivation. A suspension culture of Arabidopsis thaliana cells that 

s has been established by Axelos et al. (1992, Plant Physiol. Biochem. 30, 1-6) may be used. 
One day old freshly subcultured cells are diluted five times into AT medium (Gamborg B5 
medium, 30g/l sucrose, 200ng/l NAA). 10|il of saturated Agrobacterium containing the 
transforming T-DNA constructs are added to 10ml diluted cells in a 100m! erlenmeyer. The 
co-cultivation is agitated slowly (80rpm) for 2 days. The cells are then washed 3 times into 

10 AT medium and finally resuspended in the same initial volume (10ml). The culture is 
agitated for 3 days to allow expression before plating on selection plates (AT/BactoAgar 
8g/l+gentamycin 50|ag/ml). Transformed individual calli are isolated 3 weeks later. 

Tester Strain 

The tester construct on plasmid pPF57 was introduced into Arabidopsis cells of 
15 wildtvpe Columbia using the transformation protocol described above. Among 10 candidate 
transformams. one cell clone was shown (by Southern analysis) to have a unique T-DNA 
insertion. All three genes were shown to be functional in this cell line as indicated by 
resistance to kanamycin, blue staining in the presence of X-Glu (GUS), and sensitivity to 
5-fluoro-cytosine (codA). 
20 MMR altering genes (described above) were then introduced individually into the 

tester line and transformed cells are used for analysis of both Microsatellite Instability and 
Forward Mutagenesis. 

Microsatellite analysis 

Microsatellites have been described in Arabidopsis (Bell and Ecker, 1994, Genomics 
25 19.1 37-144). The present Example is based on a study of instability of microsatellites that 

are polymorphic amongst different ecotypes. DNA is extracted from the transformed calli. 

Specific primers have been defined that are used to amplify the microsatellite sequence. 

One of the two primers is previously P 32 labelled by T4 kinase. In case of a polymorphic 

variation, new PCR products appear that do not follow the expected pattern of migration on 
30 a polyacrylarnide gel. This is a commonly observed feature for MMR deficient cells in 

yeast or mammalian cells. 

In particular, the present Example describes a study on microsatellites ca72 (CA|g)> 
ngal72 (GA 29 ). and ATHGENEA(A 39 ), chosen because they belong to the types 
predominantly affected in human mismatch repair deficient tumors. The size of these 
35 microsatellites is not conserved from one Arabidopsis ecotype to the other. 

Arabidopsis cells which are transformed with an MMR altering gene (above) and 
control cells not expressing the MMR altering gene are allowed to form calli. DNA is 
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rapidly extracted from the calli and is analysed for mierosatelliie instability as described in 
detail by Bell and Ecker I c )94. Genomics 19. 137*144. In summary, the relevant 
microsatellite is amplified by PGR using* P32 labelled primers. The PGR products are 
separated on a DNA sequencing gel for size determination. Size differences between 
5 microsatellites from transformed and control cells not expressing the MMR altering gene in 
question indicate microsatellite instability as a result of MMR alteration. 

The sequences of primers used for PCR amplification of microsatellites ca72 and 
ngal72 are included in Table 1 . PCR amplification of microsatellite ATHGENEA made use 
of a forward primer containing the sequence 

io ACCATGCATAGCTTAAACTTCTTG (SEQ ID NO:32) 

and of a reverse primer containing the sequence 

ACATAACCACAAATAGGGGTGC (SEQ ID NO:33). 

The amplification for microsatellite ca72 revealed in Columbia control cells (with 
respect to the MMR altering gene) a 248 bp long PCR fragment instead of the published 
15 length of 124 bp. DNA sequencing verified this fragment as a CA iH microsatellite. 

Forward mutagenesis assav 

Tester cells transformed with antisense AtMSHS or antisense A(MSH6 or both 
AlMSH3/AtMSH6 are analysed for the stability of the codA gene. The functional codA gene 
confers to sensitivity to 5-fluoro-cytosine (5FCV whereas a gene inactivating mutation in 

20 codA will confer resistance to 5FC. The frequency of resistant cells is therefore a good 
indicator of somatic variation as a direct result of MMR alteration. Variants resistant to 5FC 
are first analysed for GUS activity. If GUS is inactive, 5FC resistance is assumed to be due 
to gene silencing (all three genes are under the 35S promoter). If GUS is active, 5FC 
resistance is assumed to be due to forward mutations that have inactivated codA. PCR is 

25 then performed on the putative codA mutant genes which is then sequenced to confirm the 
presence of forward mutations in codA. 

Besides codA, other marker genes may also be used for the Forward Mutagenesis 
Assay such as the ALS gene (conferring sensitivity to valine or to sulfonylurea; Hervieu and 
Vaucheret, 1996, Mol. Gen. Genet. 251 220-224; Mazur et al. 1987, Plant Physiol. 85 1110- 
30 1117). 

Example 3, Homeologous meiotic recombination in Arabidopsis thaliana 

A. Construction of a meioevte specific gene expression cassette comprising the DMC\ 
promoter and the NOS terminator 

(i) The DMCl promoter may be used as published by Klimyuk and Jones, 1997, 
35 Plant J. 11,1-14). To obtain a more convenient alternative for gene cloning, a 3.3 Kb 
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long subfragment of the DMC\ promoter was obtained by PCR from genomic DNA of 
Arabidopsis ihaliana (ssp. Landsberg erecta "Ler"). 
The PCR was done in three rounds: 

Round One: A 3.7 Kb long product was obtained using the forward primer 
5 DMCIN-A comprising the sequence 

GAAGCGATATTGTTCGTG (SEQ ID NO: 34) 
and the reverse primer DMCIN-B comprising the sequence 

AGATTGCGAGAACATTCC (SEQ ID NO:35). 
The weak amplification product was then used as template for round two and three. 
io Round Two: A 3.1 Kb long product comprising the promoter and the 5* 
untranslated leader was obtained using forward primer DMCIN-1. which contained the 
sequence 

acgcgtcgacTCAGCTATGAGATTACTCGTG (SEQ ID NO:36) 
and introduced a Sail cloning site at the 5* end of the promoter fragment, and reverse 
i5 primer DMCIN-2 which contained the sequence 

gctctagaTTTCTCGCTCTAAGACTCTCT (SEQ ID NO:37) 
and introduced a Xbal site at the 3' end of the PCR fragment. 

Round Three: A 0.2 Kb long product comprising the first exon/intron of the DMCl 
promoter was obtained using forward primer DMCIN-3, which contained the sequence 
20 gctctagaGCTTCTCTTAAGTAAGTGATTGAT (SEQ ID NO:38) 

and introduced a Xbal site at the 5' end of the PCR fragment, and reverse primer 
DMCIN-4, containing the sequence 

tcccccgggctcgagagatctccatggTTTCTTCAGCTCTATGAATCC (SEQ ID NO:39) 
and introduced at the 3' end of the PCR product restriction sites for NcoL Bgllh Xhol and 
25 Smal. 

The products obtained in round Two and Three were digested with Xbal and 
subsequently ligated to reconstitute a 3.3 Kb long DMCl promoter from which the first 
two in-frame ATG start codons were replaced with a unique restriction site for Xbal. 
This promoter can be cloned between the restriction sites for Sail and Smal of p3264, 
30 which contains the SacI-EcoRl NOS terminator in pBIN19, to yield the entire expression 
cassette in pBIN19. This cassette contains the following cloning sites: Ncol y BgHh Xhol. 
Smal and (already present on p3264) Kpnl and Sad. 

(ii) Another strategy yielded the following convenient DMCl promoter. A 1.8 kb 
DNA fragment comprising the 3' terminal pan of the meiocyte specific DMCl promoter 
35 was isolated by PCR from purified genomic DNA of Arabidopsis thaliana (ssp. Landsberg 
erecta "Ler"). The forward PCR primer (DMCl a) contained the sequence 
acgcgtcgacGAATTCGCAAGTGGGG (SEQ ID NO:40) 
and introduced a Sail cloning site at the 5* end of the promoter fragment. The reverse 
PCR primer (DMClb) contained the sequence 
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tccatggagatctcccgggtacCGATTTGCTTCGAGGG (SEQ ID NO:41) 
introducing a polylinker region at the 3' end of the promoter fragment. The PCR reaction 
was carried out with VENT DNA Polymerase (NEB) over 25 cycles using the following 
cycling protocol: 1 minute at 94°C, I minute at 54°C, 2 minutes at 72°C. 

5 The PCR reaction yielded a blunt ended DNA fragment which was digested with 
restriction enzyme Sail and was cloned into the cleavage sites of restriction enzymes Sail 
and Smal in plasmid p2030, a pUC118 derivative containing the Sacl-EcoRl NOS 
terminator fragment of pBIN121. The cloning yielded plasmid p2031, containing the 
DMCl promoter-poly linker-NOS terminator expression cassette depicted in Figure 18. 

io B. Construction of an MSH3 antisense gene under the control of the DMCl promoter 

A 2.1 kb DNA fragment encoding the carboxy terminal part of AtMSH3 was 
removed from the polylinker of clone 13 described in Example 1 after (i) digestion with 
Kpnl, (ii) blunting of the DNA ends generated by Kpnl and (iii) digestion with BamUl. 
The isolated fragment was then cloned in antisense orientation downstream of the DMCl 
is promoter in plasmid p203L which had been digested with restriction enzymes Smal and 
BglU. This cloning yielded plasmid p2033 (Figure 18). 

After digestion of p2033 with EcoRl, a 4.1 kb DNA fragment was recovered 
comprising the DMCl promoter, the partial MSH3 cDNA in antisense orientation with 
respect to the promoter and the NOS terminator. This fragment was cloned into the EcoRI 
20 restriction site of plant transformation vector pNOS-Hyg-SCV to yield plasmid p3242 
(Figure 18). 

C. Construction of a combined MSH6/MSH3 antisense gene under the control of a 
single DMCl promoter 

A 3.1 kb fragment, encoding in antisense orientation the partial AtMSH6 and AtMSH3 
25 sequences and the 35S terminator, was isolated from pCW186 by digestion with Kpnl. 
The fragment was treated with Klenow enzyme to blunt both ends. It was then cloned into 
the Smal site of plasmid p3243 (a pNOS-Hyg-SCV derivative, illustrated in Figure 19), 
which had been opened to delete the region between the Smal sites. Clones containing the 
fragment in the antisense orientation with respect to the DMCl promoter (described in 
30 A(ii) above) were identified by diagnostic digestion with BamUl. The selected construct 
was named p3261. * 

Another practical way of cloning the double antisense gene is as follows. A 1 kb 
DNA fragment encoding the carboxyterminal part of AtMSH6 is isolated from clone 62 
described in Example \ after digestion of clone 62 plasmid DNA with SamHI, which 
35 cleaves in the 5' polylinker region flanking the partial cDNA, and with EcoRl, which 
cleaves within the cDNA. The isolated fragment is treated with Klenow enzyme to blunt 
both its ends and is cloned into the recipient plasmid p2033 or p3242. For the purpose of 
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cloning, the recipient plasmid may be cleaved with either Aval or Ncol and can be blunted 
with Klenow enzyme to produce blunt acceptor ends for fragment cloning. This cloning 
yields two opposite orientations of cloned fragment DNA with respect to the DMCl 
promoter. These can be identified by diagnostic digestion with restriction enzymes Seal 

5 or Xmnl in conjunction with Sad, The selected construct contains the DMCl promoter, 
the combined partial cDNAs for AtMSH3 and AtMSH6 (both cloned in antisense 
orientation with respect to the DMCl promoter) and the NOS terminator. If the recipient 
plasmid is p2033, the combined antisense gene under control the single DMCl promoter is 
recovered from the vector after EcoRI digestion and is cloned into the EcoRl restriction 

io site of pNOS-Hyg-SCV. 

D. Construction of a full-leneth MSH3 sense gene under control of the DMCl promoter 
for overexpression of functional MSH3 protein 

Overexpression of MSH3 protein was shown in human cells (Marra et al., 1998/ 
Proc. Natl. Acad. Sci. USA 95, 8568-8573) to complex all available MSH2 protein. This 
is leaves MSH6 protein without its partner, leading to the degradation of MSH6 protein and 
eventually to a mismatch repair phenotype. 

This phenomenon is exploited to increase homeologous meiotic recombination in 
Arabidopsis as an alternative to antisense inhibition of A1MSH6. For this purpose the full- 
length cDNA encoding AtMSH3 is isolated from plasmid pPF66 by digestion with Smal 
20 and is cloned into the Smal site of the DMCl expression cassettes described in A(i). 

E. Selection of Recombination markers on homeologous chromosomes of Arabidopsis 
thaliana subspecies Landsberz erecia (Ler). Columbia (Col) and C24, respectively 

E(i). Visual recombination markers in Arabidopsis th. subspecies C24: 

Agrobacterium mediated transformation with a T-DNA containing a 35S-GUS gene, 
25 inactivated by insertion of a 35S-Ac transposable element (Finnegan et al., 1993, Plant 
MoL Biol. 22, 625-633), had yielded a C24 line in which the T-DNA construct was 
integrated into chromosome 2. Genetic and molecular analysis of this line shows that the 
Ac transposon had excised from its T-DNA locus thereby restoring GUS activity, but had 
re-inserted into the chromosome at a distance of 16.4 cM, where it stayed fixed (due to 
30 disablement of Ac) within the chlorina gene. Insertional inactivation of the chlorina gene 
caused a bleached phenotype in those plants that were homozygous for this mutation. 
Because of the two linked phenotypic markers, chlorina3A;Ac and GUS, this C24 line was 
used in crosses to wildtype Ler for analysts of meiotic homeologous recombination on 

chromosome 2 in conjunction with molecular recombination markers. 

v 

35 E(ii). Visual recombination markers in Arabidopsis th. Ler: 

The Ler line NW1 (obtained from NASC, Nottingham, UK) contains one recessive 
visual marker per chromosome, i.e. an-1 on Chr.l, py-1 on Chr.2, gll-1 on Chr.3, cer2-l 
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on Chr.4. and msl-1 on Chr.5. This line is used in crosses to wildtype C24 which 
expresses an MMR altering gene tor analysis of meiotic homeologous recombination on 
chromosomes 1-5 in conjunction with molecular recombination markers listed in Table 1. 

Other Ler lines from NASC have several visual markers in close proximity to each 
5 other on the same chromosome. When these lines are used for hybrid production, analysis 
of homeologous meiotic recombination may make use entirely of visual recombination 
markers. Particularly suitable for crossing to C24 wildtype that is expressing a MMR 
altering gene are the following Ler lines: 

NW22: relative markers are disl - (4 cM) - ga4 - (11 cM) - thl on chromosome 1. 
io NW10: relevant markers are tz~201 - (5 cM) - cer3 on chromosome 5. 
NW134, relevant markers are ttg - (4 cM) - ga3 on chromosome 5. 

NW24 (abi3-l) and NW64 (gllA). When present in the same plant on chromosome 
3, abi3A and g/M are 13 cM apart. Since this marker combination is not available from 
NASC, we have combined these markers by crossing line NW24 to line NW64. The Fl 
15 offspring were allowed to self-fertilise and to produce F2 seeds. F2 Plants which carry 
both markers as homozygous traits on the same chromosome can be identified firstly, by 
germinating F2 seeds on germination medium containing selective concentrations of 
abscisic acid, and subsequently, by identifying among the abscisic acid resistant plants 
those individuals which show the glabra phenotype. 

20 E(iii) Molecular recombination markers in CoL Ler and C24: 

The genome of Arabidopsis thaliana is interspersed with unique base sequences 
arranged as simple tandem repeats. Allelic repeats can vary in length between different 
Arabidopsis subspecies and when amplified by PCR yield diagnostic DNA products of 
different length named Simple Sequence Length Polymorphisms (SSLPs). Many SSLPs 
25 have been genetically mapped and have been assigned to unique chromosome locations on 
the recombinant inbred map (Bell and Ecker. 1994, Genomics 19, 137-144; Lister and 
Deans lines, Weeds World 4i, May 1997). 

In Table 1 are listed 28 mapped and established SSLPs between Ler and CoL A 
number of PCR primer pairs are described herein (SEQ ID NO:42 to SEQ ID NO:97) 
30 which also yielded SSLPs between C24 and Ler (19 SSLPs) or between C24 and Col (25 
SSLPs), respectively. Polymorphic SSLPs can be used as molecular markers in the 
analysis of homeologous recombination between genomes from these subspecies. 

The PCR reactions (25 |aL) were carried out over 35 cycles (15 seconds at 94°C, 30 
seconds at 55°C and io seconds at 72°C), with 0.25 U Taq DNA polymerase and 0.6 fig 
35 genomic DNA in reaction buffer containing 2 mM MgC^. PCR products were separated 
by agarose gel electrophoresis (4% ultra high resolution agarose) and visualised by 
ethidiumbromide staining. The results from the PCR experiments are summarised in 
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Table 1. which also shows the sequence of PCR primers, primer annealing temperature 

(Tm). PCR product length and chromosome location of SSLP (with respect to the RI map 

of May 1997, Weeds World 4i). 

F. Production of hybrid plants 
5 C24 plants heterozygous for chlorina3A:Ac/GUS are crossed as male to emasculated 

wildtype Ler to produce Ler/C24(chlorina3A, GUS) hybrid seeds. 

Due to the heterozygosity of the C24 parent, only 50 % of hybrid plants have 

inherited the chlorina3A:Ac/GUS locus. The remaining 50% of hybrid plants are wildtype 

with respect to chlorina3A:Ac/GUS. Since the mutant locus is linked to a kanamycin 
io resistance gene (contained on the same T-DNA as GUS) mutant plants can be pre-selected 

by germinating hybrid seeds on germination medium containing 50 mg/L kanamycin. 

Ler plants homozygous for the five chromosome markers are male sterile (msl-1) 
and are crossed without emasculation to wildtype C24 to produce Ler{an-1, py-1, gll-1, 
cer2-l, msl-l)IC24 hybrid seeds. 
15 Other Ler plants, which are male fertile, are crossed after emasculation of the female 
parent to produce Ler/C24 hybrid seeds. 

r, Introduction of MSH?> and MSH6/3 antisense genes in to Arabidopsis and analysis of 
meiotic homeologous recombination 

(i) Transformation of hybrid plants and analysis of home ologous meiotic recombination 

20 The plant transformation vectors comprising the antisense genes described in (B) and 
(C) above are introduced into Agrobacterium mmefaciens strain AGLl (Lazo et al.. 1991, 
Bio/Technology 9, 963-967) by electroporation. Recombinant Agrobacterium clones are 
selected on LB medium containing 50 mg/L rifampicin and 100 mg/L carbenicillin. 
Selected clones are used to infect roots of Arabidopsis hybrid plants (described in (F) 

25 above) using the root transformation protocol of Valvekens et al. (1988, PNAS 85. 5536- 
5540) except that shoot and root inducing media contain hygromycin (10 mg/L) instead of 
kanamycin. 

Plants regenerated from roots of hybrid plants are genetic clones of root donating 
plants and therefore are again genetic hybrids of two Arabidopsis subspecies described in 
30 (F). However, in contrast to the root donating plants, the regenerated hybrid plants also 
contain the introduced transgene and the co-introduced hygromycin resistance gene with 
the latter allowing these plants to grow on hygromycin containing culture medium. 

Hygromycin resistant plants are then allowed to enter the reproductive phase and to 
produce gametes by meiotic divisions of microspore and megaspore mothercells. At 
35 meiosis, the DMC\ promoter is activated and can direct the expression of antisense genes 
described in (B) and (C) above, leading to decreased MSH6 and/or MSH3 gene 
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expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 protein, 
thus causing alteration of MMR during meiotic divisions and increasing the recombination 
frequency between homeologous chromosomes. 

Transgenic plants are then allowed to self-fertilise and to produce seeds. These 
5 seeds (F2 seeds with respect to hybrid production), and the plants derived therefrom, 
carry the homeologous recombination events which can be identified by using the visual 
and molecular recombination markers described in (E) above. 

In case of homeologous recombination between chromosomes of Ler and 
C24{chlorina3A:Ac, GC/S), the analysis concentrates on chromosome 2 by selecting plants 

io showing the visual phenotypic marker chlorine This marker thus serves as a reference 
point as it indicates that respective chromosomes 2 originate from C24. Other markers, 
such as GUS or molecular markers, on chromosome 2 may then be used to identify 
chromosomal regions which are derived from the Ler chromosome as a result of 
homeologous recombination. F2 plants of control transformants not expressing the 

15 antisense gene(s) can be analysed in parallel and the results can be used for comparison to 
homeologous recombination results obtained in antisense plants. 

(ii) Transformation of C24 wildtvpe, hvbrid plant production and analysis of homeologous 
meiotic recombination 

Introduction of MMR altering genes into wildtype C24 is done using the root 
20 transformation protocol as described in G(i) for transformation of hybrid plants. 
Transformed plants are selected by resistance to either 10 mg/L hygromycin (in case of 
transformation with T-DNA's derived from pNOS-Hyg-SCV) or to 50 mg/L kanamycin 
(in case of transformation with T-DNA's derived from pBIN19). 

Transgenic plants are then allowed to self-fertilise and to produce seeds (Tl seeds). 

25 Segregation of the antibiotic resistance gene in the Tl population then indicates the 
number of transgene loci. Lines with a single transgene locus (indicated by a 3:1 ratio of 
resistant:sensitive plants) are selected and are bred to homozygosity. This is done by 
collecting selfed seeds (T2) from Tl plants and subsequent testing of at least four 
independent T2 seed populations for segregation of the antibiotic resistance gene. T2 

30 populations which do not segregate the antibiotic resistance gene are assumed to be 
homozygous for both the resistance gene and the linked MMR altering gene. 

C24 plants homozygous for the MMR altering gene are then crossed to Ler lines 
homozygous for recessive visual markers (see E(ii)) to produce C24/Ler hybrid plants as 
described in (F). These Fl hybrids are then allowed to enter the reproductive phase and to 
35 produce gametes by meiotic division of microspore and megaspore mothercells. At 
meiosis, the DMC 1 promoter is activated and can direct the expression of antisense or 
sense genes described in (B), (C) and (D) above, leading to decreased MSH6 and/or MSH3 
gene expression. This in turn depletes the gamete mothercells of MSH6 and/or MSH3 
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protein, thus causing alteration of MMR during meiotic divisions and increasing the 
recombination frequency between the homeologous chromosomes of C24 and Ler. 
Recombination events are then scored in the F2 generation. 

For recombination analysis, the hybrid plants are allowed to self- fertilise and to 
produce F2 seeds. F2 plants are then preselected for a first visual marker. Since this 
marker is recessive, its visual presence indicates homozygosity for Ler DNA at the 
relevant locus. Those F2 plants which show this first visual marker are then analysed for 
the presence or absence of a second visual marker which in the Ler parent is closely linked 
to the first marker. Absence of the second visual marker indicates recombination between 
the relevant C24 and Ler chromosomes between the first and second marker. The 
frequency of recombination in transgenic hybrids is compared to the recombination 
frequency in control hybrids not expressing the MMR altering gene. 

Examples of recombination analysis are the following. 

The Ler line NW22(disL ga4 . thl) is used for crosses to transformedC24. 

F2 plants are preselected first for thiamine requirement (thl) and then are further 
analysed for re-appearance of wildtype height (loss of ga4) and/or re-appearance of 
normal trichomes (loss of disl) as a result of recombination. 

The Ler line NW10(rz-20/, cer3 ) is used for crosses to transformedC24. 

F2 plants are then preselected first for thiazole requirement (tz) and then are further 
analysed for re-appearance of normal, i.e. non-shiny stems (loss of cer3) as a result of 
recombination. 

The Ler line NW134 (ng , ga3 ) is used for crosses to transformedC24. F2 plants 
are first preselected for dwarfish appearance (ga3) and are then analysed for re-appearance 
of trichomes (loss of ng) as a result of recombination. 

Ler plants homozygous for abi3-\ and g/7-1 are used for crosses to transformed C24. 
F2 plants are first preselected for their ability to germinate in the presence of abscisic acid 
and are then analysed for re -appearance of trichomes on the leaves (loss of gll-l) as a 
result of recombination. 

In the case of homeologous recombination between transformed C24 and the Ler line 
NW1 (an-l y py-1, gll-l,, cer2-\, msl-\), recombination analysis is similar the one 
described above, except that the second marker is not a visual marker but has to be a 
molecular marker. This is because the Ler parent carries only one visual marker per 
chromosome. 
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CLAIMS 

I. An isolated and purified DNA molecule comprising a polynucleotide sequence 
encoding a polypeptide functionally involved in the DNA mismatch repair system of a 
plant. 

5 2. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to a mismatch repair polypeptide of a yeast or of a human. 

3. A DNA molecule according to claim 1 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

4. An isolated and purified polypeptide functionally involved in the DNA 
io mismatch repair system of a plant. 

5. A polypeptide according to claim 4 which is homologous to a mismatch repair 
polypeptide of a yeast or of a human. 

6. An isolated and purified polypeptide selected from the group consisting of a 
polypeptide encoded by the gene AtMSH3 (SEQ ID NO: 18), a polypeptide encoded by the 

15 gene AtMSH6 (SEQ ID NO:30), polypeptides homologous to a polypeptide encoded by the 
gene AtMSH3 (SEQ ID NO: 18) and polypeptides homologous to a polypeptide encoded 
by the gene AtMSH6 (SEQ ID NO.30). 

7. An isolated and purified DNA molecule comprising a polynucleotide sequence 
selected from the group consisting of (i) a sequence encoding a polynucleotide which is 

20 capable of interfering with the expression of a plant polynucleotide sequence encoding a 
polypeptide which is homologous to a mismatch repair polypeptide of a yeast or of a 
human and thereby disabling said plant polynucleotide sequence; and (ii) a sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 

8. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
25 encoding a polynucleotide capable of interfering with the expression of a plant 

polynucleotide sequence encoding a polypeptide which is homologous to a mismatch 
repair polypeptide of a yeast or of a human and thereby disabling said plant polynucleotide 
sequence. 

9. A DNA molecule according to claim 8 wherein said polynucleotide is capable 
30 of interfering with the expression of a plant polynucleotide sequence is a sense 

polynucleotide, an antisense polynucleotide or a ribozyme. 

10. A DNA molecule according to claim 7 comprising a polynucleotide sequence 
encoding a polypeptide capable of disrupting the DNA mismatch repair system of a plant. 
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11. A DNA molecule according to claim 10 wherein said polypeptide is 
homologous to AtMSH3 (SEQ ID NO: 19) or to AtMSH6 (SEQ ID NO: 31). 

12. A DNA molecule according to claim 10 further comprising a regulation 
element capable of causing overexpression of said polypeptide in a cell of said plant. 

s 13. A chimeric gene comprising: 

a DNA sequence selected from the group consisting of (i) a sequence encoding a 
polynucleotide capable of interfering with the expression of a plant polynucleotide 
sequence encoding a polypeptide which is homologous to a mismatch repair polypeptide of 
a yeast or of a human and thereby disabling said plant polynucleotide sequence, and (ii) a 
io sequence encoding a polypeptide capable of disrupting the DNA mismatch repair system 
of a plant; and 

at least one regulation element capable of functioning in a plant cell. 

14. A chimeric gene according to claim 13 wherein said regulation element is 
selected from constitutive, inducible, tissue type specific and cell type specific promoters. 

is 15. A chimeric gene according to claim 13 comprising a DNA sequence encoding 
a polypeptide capable of disrupting the DNA mismatch repair system of a plant, wherein 
said regulation element is capable of causing overexpression of said polypeptide in a cell 
of said plant. 

16. A chimeric gene according to claim 13 wherein said regulation element is 
20 selected from the group consisting of 35S, NOS, PRla, AoPRl and DMC1. 

17. A plasmid or vector comprising a chimeric gene according to any one of 
claims 13-16. 

18. A plant cell stably transformed, transfected or electroporated with a plasmid or 
vector according to claim 17. 

25 19. A plant comprising a cell according to claim 18. 

20. A plant according to claim 19 selected from plants of the families 
Brassicaceae, Poaceae, Solanaceae, Asteraceae, Malvaceae, Fabaceae, Linaceae, 
Canabinaceae, Dauaceae and Cucurbitaceae. 

21. A process for at least partially inactivating a DNA mismatch repair system of a 
30 plant cell, comprising transforming or transfecting said plant cell with a DNA molecule 

according to any one of claims 1-3 or 7-12 and causing said DNA sequence to express 
said polynucleotide or said polypeptide. 

22. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a chimeric gene 
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according to any one of claims 13-16 and causing said DNA sequence to express said 
polynucleotide or said polypeptide. 

23. A process for at least partially inactivating a DNA mismatch repair system of a 
plant cell, comprising transforming or transfecting said plant cell with a plasmid or vector 

5 according to claim 17 and causing said DNA sequence to express said polynucleotide or 
said polypeptide. 

24. A process for increasing genetic variation in a plant comprising obtaining a 
hybrid plant from a first plant and a second plant, or cells thereof, said first and second 
plants being genetically different; altering the mismatch repair system in said hybrid plant; 

io permitting said hybrid plant to self-fertilise and produce offspring plants; and screening 
said offspring plants for plants in which homeologous recombination has occurred. 

25. A process according to claim 24 wherein a first gene is incapacitated in said 
first plant, a second gene is incapacitated in said second plant, and said first and second 
genes are incapacitated in said hybrid plant thereby altering the mismatch repair system of 

15 said hybrid plant. 

25. A process according to claim 25 wherein said incapacitation of the mismatch 
repair system of said hybrid plant is reversible. 

26. A process according to claim 24 wherein a new genetic linkage of a desired 
characteristic trait or of a gene which contributes to a desired characteristic trait is 

20 observable in at least one of said offspring plants. 

27. A process for obtaining a plant having a desired characteristic, comprising 
altering the mismatch repair system in a plant, cell or plurality of cells of a plant which 
does not have said desired characteristic, permitting mutations to persist in said cells to 
produce mutated plant cells, deriving plants from said mutated plant cells, and screening 

25 said plants for a plant having said desired characteristic. 

28. A process according to claim 27 wherein said step of altering the mismatch 
repair system comprises introducing into said hybrid plant, plant, cell or cells a chimeric 
gene according to claim 13 and permitting the chimeric gene to express a polynucleotide 
which is capable of interfering with the expression of a plant polynucleotide sequence in a 

30 mismatch repair gene of the hybrid plant, plant, cell or cells, or a polypeptide capable of 
disrupting the DNA mismatch repair system of the hybrid plant, cell or cells. 

29. A process according to claim 28 comprising inactivating an MSH3 gene and/or 
an MSH6 gene of said plant. 

30. A process according to claim 28 comprising inactivating an MSH3 gene and an 
35 MSH6 gene of said plant. 
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31. A process according to claim 27 comprising at least partially inactivating the 
mismatch repair system of said plant in a predetermined cell type or in a predetermined 
tissue of said plant. 

32. A process according to claim 31 further comprising restoring mismatch repair 
in said cell type or said tissue. 

33. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to a DNA molecule according to claim 1 with the proviso that said 
oligonucleotide is other than SEQ ID NO: 1 or SEQ ID NO:2. 

34. An oligonucleotide capable of hybridising at 45°C under standard PCR 
conditions to the DNA of SEQ ID NO: 18 with the proviso that said oligonucleotide is 
other than SEQ ID NO: 1 or SEQ ID NO:2. 

35. An oligonucleotide capable of hybridising at 45 °C under standard PCR 
conditions to the DNA of SEQ ID NO:30 with the proviso that said oligonucleotide is 
other than SEQ ID NO:l or SEQ ID NO:2. 
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Figure 2 



Comments/References: 52= 3' side of S5 (AtMSH3) 1244bp in pUC18/Sma1 

i 
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Figure 3 



Comments/References: 13 = 3' side of S5 (AtMSH3) 2104bp in pUC18/Smal 
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Figure 7 

Comments/References: 43= 5' side of S8 (AtMSH6) 2182 bp in pUCl8/Sma1 
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Figure 8 



Comments/References: 62= 3' side of S8 (AtMSH6) 1379bp in pUCl8/Sma1 
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TTTTT7GG7TGC7AACAATAAAGGTATACGGTTTTATG7CA7CAATATAA 

CTATA7ATAAAAGAAATGAAAGATA7A7AT7GTT7TT7CATTTATCAAAC 

AAAACAACAAGACTTTTTTTTTACTTTTTACATTGGTCAACAAAATACAA 

GATAAACGACATCGTTTAATCATTTCCCAATTTTACCCCTAAGTTTAACA 

CCTAGAACCTTCTCCATCTTCGCAAGCACAGCCTGATTAGGAACAGCTTT 

ACCATTCTCATATTCCTGAACTACCTGAGTCCTCTCATTGATCTGTTTCG 

CCAAATCCGCTTGTGACATCTTCTTCTCCAATCTCGCTTTCTGTATCATC 

AACCTCACCTCTGCTTTCACACGATCCATCGCCGCAGGCTCTGTTTCTTC 

TTCCAGCTTCTTCGTGTTAATCACCGGAACCGCCGTAGATTTCCCCTTTT 

TGTTCGAACCGGCATCGAATTTCTTAACCGTTTGAACCGCGACACCGTTT 

CTCAGAGCTGCGTTAACCGCTTTCGGATCGCGTAGGTCTTGGCTCTTTTG 

TTTTGATTTGTGGAGAACTACTGGTTCCCAGTCTTGTGTTACTGCTCCTG 

GGTATCTGCTCGGCATCGTCGATGAATTGAGAGAAAGGAACAACGCGAAA 

ATTT T AT T AAT CT G AGT TTT G AAAT TG AG AAAC G ATGAAG ATG AAG AAT G 
TTGTTGAGAGGATTGTGATATTTATATATACGAAGATTGGTTTCTGGAGA 
ATTCGATCATCTTTTTCTCCATTTTCGTCTCTGGAACGTTCTTAGAGATG 
ATTGACGACGTGTCA7TATCTGATTTGCAGTTAACCAATGCTTTTTGGGT 
7GGATTCGTGGTACACCATATTATCCGAT7TGGC7CAATGGTTTTATATA 
EJVTTTGG7T7TCGGTTCGG77A.TGAGT7ATCATTAAAATTAAGCTAACCA 
AAAA7777CG7AAAATT7AT7TCGG77TCAATTCGGA7CCC77ACT7CCA 
GAACCGAATTATTCGAAACCGGGGT7AGCCGAACCGAATACCAATGCCTG 
ATTGAC7CG77GGCTAGAAAGATCCAACGGTA7ACAA7AATAGAACATAA 
ATCGGACGG7CA7CAAAGCCTCAAAGAG7GAACAGTCAACAAAAAAAGTT 
GAGCCCTGAGGAGTA7CGTTTCCGCCAT7TCTACGACGCAAGGCGAAAAT 
7TTTGGCGCCAATCTTTCCCCCCT77CGAATTCTC7CAGC7CAAAACATC 
GTTTC7C7C7CACTC7CTCTCACAA77CCAAAAAATGCAGCGCCAGAGAT 
CGA7T77GTC7T7C7TCCAAAAACCCACGGCGGCGAC7ACGAAGGGTTTG 
GT77CCGGCGA7GCTGCTAGCGGCGGGGGCGGCAGCGGAGACCACGATTT 
AA7GTGAAGGAAGGGGATGCTAAAGGCGACGCTTC7G7ACGTTTTGCTGT 
TTCGAAATCTGTCGA7GAGGTTAGAGGAACGGATACTCCACCGGAGAAGG 
TTCCGCGTCGTGTCCTGCCGTCTGGATTTAAGCCGGCTGAATCCGCCGGT 
GATGCTTCG7CCCTGTTCTCCAATATTATGCATAAGTTTGTAAAAGTCGA 
TGATCGAGATTGTTCTGGAGAGAGGTACTAATCTTCGATTCTCTTAATTT 
TGTTATCTTTAGCTGGAAGAAGAAGATTCGTGTAATTTGTTGTATTCGTT 
GGAGAGATTCTGATTACTGCATTGGATCGTTGTTTACAAATTTTCAGGAG 
CCGAGAAGATGTTGTTCCGCTGAATGATTCATCTCTATGTATGAAGGCTA 
ATGATGTTATTCCTCAATTTCGTTCCAATAATGGTAAAACTCAAGAAAGA 
AACCATGCTTTTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTAGAAGA 
TATAGGAGTAGATGGCGATGTTCCTGGTCCAGAAACACCAGGGATGCGTC 
CACGTGCTTCTCGCTTGAAGCGAGTTCTGGAGGATGAAATGACTTTTAAG 
GAGGATAAGGTTCCTGTATTGGACTCTAACAAAAGGCTGAAAATGCTCCA 
GGATCCGGTTTGTGGAGAGAAGAAAGAAGTAAACGAAGGAACCAAATTTG 
AATGGCTTGAGTCTTCTCGAATCAGGGATGCCAATAGAAGACGTCCTGAT 
GATCCCCTTTACGATAGAAAGACCTTACACATACCACCTGATGTTTTCAA 

Figure 11 
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GAAAA7G7C7GCA7CACAAAAGCAA7A7TGGAG7G77AAGAG7GAA7A7A 
TGGACATTGTGCTTTTCTTTAAAGTGGTTAGTAACTATTAATCTAGTGTT 
CAATCCATTTCCTCAATGTGATTTGTTCACTTACATCTGTTTACGTTATG 
CTCTTCTCAGGGGAAATTTTATGAGCTGTATGAGCTAGATGCGGAATTAG 
GTCACAAGGAGCTTGACTGGAAGATGACCATGAGTGGTGTGGGAAAATGC 
AGACAGGTAAATTAGTTGAAACAACTGGCCTGCTTGAATTATTGTGTCTA 
TAAATTTTGACACCACCTTTTGTTTCAGGTTGGTATCTCTGAAAGTGGGA 
TAGATGAGGCAGTGCAAAAGCTATTAGCTCGTGGGTAAGGGAACCATCAT 
ACTTTATGGAATTCGTTTACTGCTACTTCGGCTAGGATTTAAGAAATGGA 
AATCACTTCAAGCATCATTAGTTAGGATCCTGAGAACTCAGGATGTTTTC 
TTATTCGTTATATAATAAGTCTTTTCATCAAGGAGTAACAAACAAAACTT 
GCACAATATTTGTGTGCTCACTGGCAAGGCATATATACCCAGCTAACCTT 
TGCTAGTTCACTGTAGTAACAGTTACGGATAATATATGTTTACT7G7ATG 
TGGTACCCTCATTTTGTCTCTCATGGAGGCTTTGAAGCCTTGTGTTGAAA ■ 
CTGGATAGTTACATATGCTTCCAACAGAAACTAGCATGCAGATTCATATG 
CTTTCCTATTCTACTAATTATGTATTGACACACTCGTTGTTTCTTTTGAA 
AGA7A7AAAG77GGACGAA7CGAGCAGC7AGAAACA7CTGACCAAGCAAA 
AGCCAGAGGTGCTAATACTGTAAGTTTTCTTGGATAGGTCAAGGAGAGTG 
TTGCAGAC7G777T7GA7CA777C77777C7GTACA7TAC7T7CA7GCTG 
TAATTAACTCAATGGCTATTCTGGTCTGATTATCAGATAATTCCAAGGAA 
GCTAGT7CAGG7A7TAAC7CCA7CAACAGCAAGCGAGGGAAACA7CGGGC 
C7GA7GCCG7CCA7C77C7TGC7A7.AAAAGAGG777GTTA77TAC77A7T 
7A7C7TA7CA7G77CAG77CATCCAAG7CC7GAAAAA7TACAC7C77C77 
7ACCAA7C77CCA7CAAGC7G7G7AAAGGA777GGAA77AGAAAA7CA77 
A7TTGA7GC777G7777ATATGCAAGAGGT7CCC7TGAAAAGA7CTG77T 
AAGA77C7T7GCAC77GAAAAA77CAA7C77777AAG7GAA7CCCC7AC7 
77CT7ACAA7GA7CA7AG7C7GCAA7TGCA7G7CAAG7AA7A7CA77CC7 
7G77AC7GCA7CCCCC7C777CT7AA7GACCA77G7C7A7G77G7G777G 
7C7CGTGTGC7GGAGAAAATGA7AGC7GA7CCAAGC7G7ACA77A7CA7G 
A7TAAG7AGC7GC7CAGGAA77GCC777GG77ACA77GCC7AA7GG77TG 
ATGTCAA77777C77C7GAATCT77A7T77AGA7CAAAA7GGAGC7ACAA 
AAGTG7TCAACTG7G7ATGGA777GC7T77G77GAC7GTGC7GCC77GAG 
GTT7TGGGT7GGG7CCA7CAGCGATGATGCA7CA7G7GC7GC7C77GGAG 
CGTTAT7GA7GCAGG7AAGCAAG7G7ATTCTG7A7C77A7G7G7ACCA7G 
TGACTTCCTG7GCA7ATA7TTGGGT7GCAGGAAC7AA77C7GAA7CACCA 
TTTGGTATG77TTTTCCAGGTT7C7CCAAAGGAAG7G77A7A7GACAG7A 
AAGGTAAAC7GC7TG7A7CGCGAG77GTTT7G77AAACAGAA77TAAGGT 
AAATGACAC7GGTTAAT77AAAG7GCATACATGTTGAAATA77GCAGGGC 
7ATCAAGAGAAGCACAAAAGGC7C7AAGGAAA7A7ACGT7GACAGG7ACC 
ATTTCAGTAGGCAAGCTAAC7GACAA77TAACCGCTCACCGAATGATAGG 
TCTCTTAAACATTGCTAATGTAGATGATGTTTATGTTTCAATCTAATAGG 
GTCTACGGCGGTACAGT7feGCTCCAG7ACCACAAGTAATGGGGGATACAG 
ATGCTGCTGGAGTTAGAAA7ATAATAGAATCTAACGGATACTTTAAAGGT 
7CTTCTGAA7CATGGAAC7G7GC7GT7GATGGTC7AAATGAATG7GATGT 

Figure 11 (Continued) 
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TGCCCTTAGTGCTCTTGGAGAGCTAATTAATCATCTGTCTAGGCTAAAGG 
TGTGTTGGCTTGTTTAGTTTTTGCTTTTCACAAATTAAGCAAAGGAACTT 
TTCATAACTTACAGTTTCTATCTACTTGCAGCTAGAAGATGTACTTAAGC 
ATGGGGATATTTTTCCATACCAAGTTTACAGGGGTTGTCTCAGAATTGAT 
GGCCAGACGATGGTAAATCTTGAGATATTTAACAATAGC7GTGATGGTGG 
TCCTTCAGGCAAGTGCATATTTCTTTTTTGATAACTTCAACTAGAGGGCA 
GACATAGAAGGAAAAATTCTAATACTTCGTACGGATCTCCAGTAAGTAAT 
AGCCGATTTTTGTTTACCTATGTAGGGACCTTGTACAAATATCTTGATAA 
CTGTGTTAGTCCAACTGGTAAGCGACTCTTAAGGAATTGGATCTGCCATC 
CACTCAAAGATGTAGAAAGCATCAATAAACGGCTTGATGTAGTTGAAGAA 
TTCACGGCAAACTCAGAAAGTATGCAAATCACTGGCCAGTATCTCCACAA 
ACTTCCAGACTTAGAAAGACTGCTCGGACGCATCAAGTCTAGCGTTCGAT 
CATCAGCCTCTGTGTTGCCTGCTCTTCTGGGGAAAAAAGTGCTGAAACAA 
CGAGTAAGTATCAATCACAAGTTTTCTGAGTAATGCCTTCCATGAGTAGT 
ATAGGACTAAAACATTACGGGTCTAGCTAAAGACTGTTCTCCTTCT7TTG 
CAATG7CTGGTTATTCATTACATTTCTCTTAACTTATTGCATTGCAGGTT 
AAAGCATTTGGGCAAATTGTGAAAGGGTTCAGAAGTGGAATTGATCTGTT 
GTTGGCTCTACAGAAGGAATCAAATATGATGAGTTTGCTTTATAAACTCT 
G7AAAC77CC7ATA7TAG7AGGAAAAAGCGGGC7AGAG77A7TTC777CT 
CAA77CGAAGCAGCCA7AGA7AGCGAC777CCAAAT7A7CAGG7GCCCA7 
CTATCT7TCATAC77TACAACAAAATG7C7GTCACTACTCAAAGCAATGC 
A7ATGGC77AGA7CTCAAC7CACACCCCGAGGA7CC7AAAGGGA777GC7 
7777A77CC7AA7G77777GGA7GG777GA777A777C7AAC77GAAC77 
A77AA7C77G7ACCAGAACCAAGA7G7GACAGATGAAAACGC7GAAAC7C 
TCACAA7AC77A7CGAAC77777A7CGAAAGAGCAAC7CAATGG7C7GAG 
G7CA77CACACCA7AAGC7GCC7AGA7G7CC7GAGA7C7T77GCAA7CGC 
AGCAAG7C7C7C7GC7GGAAGCA7GGCCAGGCC7G77A7T777CCCGAA7 
CAGAAGCTACAGATCAGAA7CAGAAAACAAAAGGGCCAA7AC77AAAATC 
CAAGGAC7A7GGCA7CCA777GCAG77GCAGCCGA7GG7CAA77GCC7G7 
7CCGAA7GA7A7AC7CC77GGCGAGGC7AGAAGAAGCAG7GGCAGCA77C 
ATCC7CGGTCA77G77AC7GACGGGACCAAACA7GGGCGGAAAA7CAAC7 
C77C77CGTGCAACA7G7C7GGCCG77A7C777GCCCAAG777G7A7AC7 
CG7TAGATAA7TAC7C7A77C777GCAA7CAG77C77CAACA7GAA7AA7 
AAA77C7GTT77CTG7C7GCAGC77GGC7GCTACG7GCCG7G7GAG7C77 
GCGAAATCTCCC7CG7GGA7AC7A7C77CACAAGGC77GGCGCA7C7GAT 
AGAATCA7GACAGGAGAGAG7AAG7T77G7TC7CAAAA7ACCAA77CC7C 
GAAC7AT77ACTCAGA7777G7Q7GA77GGACAAGG7GG7T77GC77777 
7T7AGG7ACC7TTT7GG7AGAA7GCAC7GAGACAGCG7CAG77C77CAGA 
ATGCAAC7CAGGATTCAC7AG7AA7CC77GACGAAC7GGGCAGAGGAAC7 
AG7AC7TTCGA7GGA7ACGCCA77GCA7ACTCGGTAACC7GC7C77CTCC 
TTCAACTTATACTTG77GAXCAACAAAAACATGCAA77CA7777GC7GAA 
ACT7ATTGAT7TATA7CAGGT777TCGTCACC7GG7AGAGAAAG77CAA7 
GTCGGATGCTCTTTGCAACACA77ACCACCCTCTCACCAAGGAA7TCGCG 
7C7CACCCACGTGTCACC7CGAAACACA7GGC77GCGCA7TCAAA7CAAG 

Figure 11 (Continued) 
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ATCTG ATTATCAACC ACGTGGTTGTG ATCAAGACCT AGTGTTCTTGT ACC 6650 

GTTTAACCGAGGGAGCTTGTCCTGAGAGCTACGGACTTCAAGTGGCACTC 6700 

ATGGCTGGAATACCAAACCAAGTGG7TGAAACAGCATCAGGTGCTGCTCA 6750 

AGCCATGAAGAGATCAATTGGGGAAAACTTCAAGTCAAGTGAGCTAAGAT 6800 

CTGAGTTCTCAAGTCTGCATGAAGACTGGCTCAAGTCATTGGTGGGTATT 6850 

TCTCGAGTCGCCCACAACAATGCCCCCATTGGCGAAGATGACTACGACAC 6900 

TTTGTTTTGCTT ATGGCATGAGAT CAAAT CCTCTT ACTGTGTTCCCAAAT 6950 

AAATGGCTATGACATAACACTATCTGAAGCTCGTTAAGTCTTTTGCTTCT 700O 

CTGATGTTTATTCCTCTTAAAAAATGCTTATATATCAAAAAATTGTTTCC 7050 

TCGATTATAACAAGATTATATATGTATCTGTCGGTTTAGCTATGGTATAT 7100 

AATAT ATGTATGTTCATGAGATTGGTCAAGAGAAATACTCACAAACAGTA 7150 

T ATTAAG AAGGAAATATGTTTATGCATT AATTTAAGTTTCAAGAT AAACT 7200 

GC AAAT AAC C T CG ACT AAAG TT G C AAAG AC C AAAC AC AAAT T AC AAAACT 7250 

TATAAGACTTAAGTTCTGAATTCCCTAAAACCAAAAAAAAAAACAGAACA 7 300 

TATTTTGTTGCATCTACAAACAACACAAACCTACATAGTTTATAACTTAC 7350 

TCATCACTGAGATTAACATCAGAATCATTCTCCATTTCTTCATCTTCACT 7 400 

CTCATCATCATCACCACCACCATGATGATTCTCCTCCTCTTCACGTAACC 7 4 50 

TAGCAATCTCACTCTGAGCTCTATCAACAATCTGCTTCTTCTGCAACTCC 7 500 

AAATCTCTCTGAAAATCAGCTCTCATCTTCTCCAACTCCTTCATTTGCTC 7 550 

TTTCTTACTCTTCTCCATCTTCTCATAAACCTTCCCAAACCTCTCAACAG 7 600 

AATCCGCCAACATCTTATACGAAGCAGCGTCATTAACCTTCTTCCTCTCG 7650 

TACTCAACCTCATCATCCTCATCCTCCTCCTCTTCAGAATCACCAGGACT 7700 

ATCCATCATCTCATCAAACCCATTAGACTTATCTAAATAAACCTTAGTGT 7750 

TCATAAACAC AAACTCACCTGAATCAAC ACCACAAGCT AAACCTAAATCC 7800 

GACTTGGGCGAAACAC AAAGCAACAT ATCC AACTTATTGAAAAACG ACCA 7850 

TTTACTTGAACCTAAACCTGATTTC7CAACCTTAATCTTCTCTTTTCTAT 7900 

ACTTCCTCTTCAAGTCATCAATCATTCTCCTACATTGCGTCTCAGATTTC 7 950 

TCCATCCTTAGCTCCTCACTCACTTTCTCAGCTACTTCATTCCAATCCTC 8000 

GTTCCTCAAACTCCTTCT ACCCAATTGCAAAAACCTATCTCCCCAAACTT 8050 

CAAGCAACACAA 8062 

Figure 11 (Continued) 
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Figure 12 



Comments/References: AtMSH3 3' side antisense : AtMSH3 3 1 (13 = 2104bp) from 
PUC18/13 Sa!1/Sst1/T4 into pCW164 BamH1/T4 in Agrobacterium LBA4404 
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Figure 13 



Comments/References: AtMSH6 (S8) 3' side antisens : 62 Sal1/Sst1/T4 (1379bp) 
into pCWl64 BamH1/T4 
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8210 EcoR1.Sac1.Kpn1 



Figure 14 



Comments/References: AtMSH6 37AtMSH3 3' antisense : AtMSH6 (S8) 3' side (62=1379bp) 
Sal1/Sst1/T4 into pPF13 (pCWl64 AtMSH3 (S5) 3' side (13=2104) antisens)/Sma1. in 
LBA4404 



r 



WO 99/19492 



24 / 28 



PCT/EP98/06977 




8210 EcoRl.Sad. Kpn1 



Figure 15 



Comments/References: AtMSH3 3VAtMSH6 3* antisens (0) : AtMSH3 (S5) 3* side 
(13=2104bp) Sa!1/Sst1/T4 into pPFl4 (AtMSH6 (S8) 3'side (62=1379bp) antisense into 
pCVV164)/Sma1. in LBA4404 
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8210 EcoRl.Sad. Kpn1 



Figure 16 



Comments/References: AtMSH3 (S8) complete, sense orientation ; pPF26 (3342bp) 
Smat into pCW164 Smal 



WO 99/19492 



26 / 28 



PCT/EP98/06977 



H3. 




Figure 17 



Commentt/Ratorone«&: pPZP111 wilh codA Ecofll cassette in Ecoftl cito and Hlnd3 
GUS cassette in Hind3 $rte. KanR. Att genes under PromotorAcrminaior 35 S 
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p3243 



Figure 19 
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SEQUENCE LISTING 



Rhone- Poulenc Agro; Betzner, Andreas Stefan; Doutriaux, 
Marie- Pascale; Freyssinet, Georges; Perez, Pascual. 



<120> Methods for obtaining plant varieties 

<130> 395498C 

<150> P09745 

<151> 1997-10-10 

<160> 98 

<210> 1 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<221> modif ied_base 

<222> 11 

<223> , I 

<220> 

<221> modif ied_base 

<222> 14 

<223> I 

<220> 

<221> modif ied_base 

<222> 17 

<223> I 

<220> 

<223> Degenerate oligonucleotides UPMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<30l> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 ; 

<400> l 

ctggatccac nggnccnaay ^tg 23 

<210> 2 

<211> 23 

<212> DNA 



I 



WO 99/19492 PCT/EP98/06977 

2 ...... 

<213> Artificial sequence 

<220> 

< 2 2 1 > modi f ied_base 

<222> 15 
<223> I 

<220> 

<221> modifie d_ba s e 

<222> 18 

<223> I 

<220> 

<223> Degenerate oligonucleotides DOMU used to isolate AtMSH3 and 
AtMSH6 . 

<300> 

<30l> Reenan and Kolodner 

<302> Genetics 

<303> 132 

<306> 963-973 

<307> 1992 

<400> 2 

ctggatccrt artgngtnrc raa 23 



<210> 3 

<211> 24 

<212> DNA 

<213 > Artificial sequence 
<220> 

<223> MSH3 specific primer 636 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 3 

tgctagtgcc tcttgcaagc teat 24 



<210> 4 

<211> 27 

<212> DNA ' 

<213> Artificial sequence 

<220> 

<223> Primer API for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 



<400> 



4 
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ccaccctaat acgactcact atagggc 27 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer AP2 for PCR using cDNA of Arabidopsis thaliana ecotype 

Columbia containing adapter sequences ligated to both its 
ends 

<400> 5 

actcactata gggctcgagc ggc 23 

<210> 6 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 6 

aggttctgat catgtgtgac gctttactta 30 

<210> 7 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 7 

ggatcgggta ctgggttttg agt,gtgagg 29 



<210> 8 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 635 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 



WO 99/19492 



PCT/EP98/06977 



4 

<400> 8 

gcacgtgctt gatggtgttt ccac 24 

<210> 9 

<211> 28 

<212> DNA 

<213> Artificial sequence ; 
<220> 

<223> MSH3 specific primer S523 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 9 

tcagacagta tccagcatgg cagaagta 28 



<210> 10 

<211> 33 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer 1S5 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 10 



atcccgggat gggcaagcaa aagcagcaga cga 3 3 



<210> 11 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S53 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> li 

gacaaagagc gaaatgaggc cccttgg 27 



<210> 12 

<211> 1250 : 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Clone 52 
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5 

<400> 12 

cccgggatgg gcaagcaaaa gcagcagacg atttctcgtt tcttcgctcc caaacccaaa 60 

tccccgactc acgaaccgaa tccggtagcc gaatcatcaa caccgccacc gaagatatcc 120 

gccactgtat ccttctctcc ttccaagcgc aagcctccct ccgaccacct cgccgccgcg 180 

tcacccaaaa agcctaaact ttctcctcac actcaaaacc cagtacccga tcccaattta 240 

caccaaagat ctctccagag atttctggaa ccctcgccgg aggaatatgt tcccgaaacg 300 

ccatcatcga ggaaatacac accaccggaa cagcaagtgg tggagctaaa gagcaagtac 360 

ccagatgtgg ttttgatggt ggaagttggt tacaggtaca gattcttcgg agaagacgcg 420 

gagatcgcag cacgcgtgtt gggtatttac gcccatatgg atcacaattt catgacggcg 480 

agtgtgccaa catttcgatt gaatctccac gcgagaagac tggtgaacgc aggatacaag 540 

accggtgcag cgaagcagac cgaaactgca gccattaagt cccatggtgc aaaccggacc 600 

ggccctcctt: rccggggacc gccggcgccg cacaccaaag ccacgcctga agcggctgag 660 

gatataagtg gtggtcgtgg tggtgaagaa ggttttggcc cacagagtaa tt tcttggtt 720 

tgtgttgtgg atgagagagt taagtcggag acactaggct gtggtattga aatgagtttt 780 

gatgttagag tcggtgttgt tggcgttgaa atttcgacag gtgaagttgt tcacgaagag 840 

ttcaatgata acctcatgag aagcggacta gaggctgcga ttttgagctt gtcaccagct 900 

gagctgttgc ttggccagcc tctttcacaa caaaccgaga agtttttggt ggcacatgct 960 

ggacctacct caaacgttcg agtggaacgt gcctcaccgg attgtttcag caatggtaat 1020 

gcagtagatg aggttatttc attatgcgaa aaaatcagcg caggtaactt agaagatgat 10 80 

aaagaaatga agctggaggc tgctgaaaaa ggaatgtctt gcttgacagt tcatacaatt 1140 

atgaacatgc cacatctgac tgttcaagcc ctcgccctaa cgctttgcca tcccaaacag 12 00 

tttggatttg aaaggatcct tcaccaaggg gcctcacctc gctctttgtc 12 50 



<210> 13 

<211> 34 

<212> DNA 

<213> Artificial, sequence 
<220> 

<223> MSH3 specific primer 2S5 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 
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6 

<400> 13 

atcccgggtc aaaatgaaca agttggtttt agtc 34 

<210> 14 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S52 for PCR using cDNA of Arabidopsis 
chaliana ecotype Columbia 

<400> 14 

gccacatctg actgttcaag. ccctcgc 27 

<210> 15 

<211> 2110 

< 2 1 2 > DNA 

<213> Arabidcps is zha liana ecotype Columbia 

<223> Clone .13 

<400> 15 

gccacatctg actgttcaag ccctcgccct aacgttttgc catctcaaac agtttggatt 60 

tgaaaggatc ctttaccaag gggcctcatt tcgctctttg tcaagtaaca cagagatgac 120 

tctctcagcc aatactctgc aacagttgga ggttgtgaaa aataattcag atggatcgga 180 

atctggctcc ttattccata atatgaatca cacacttaca gtatatggtt ccaggcttct 240 

tagacactgg gtgactcatc ctctatgcga tagaaatttg atatctgctc ggcttgatgc 300 

tgtttctgag atttctgctt gcatgggatc tcatagttct tcccagctca gcagtgagtt 360 

ggttgaagaa ggttctgaga gagcaattgt atcacctgag ttttatctcg t.gctctcctc 420 

agtcttgaca gctatgtcta gatcatctga tattcaacgt ggaataacaa gaatctttca 480 

tcggactgct aaagccacag agttcattgc agttatggaa gctattttac ttgcggggaa 540 

gcaaattcag cggcttggca taaagcaaga ctctgaaatg aggagtatgc aatctgcaac 6 00 

tgtgcgatct actcttttga gaaaattgat ttctgttatt tcatcccctg ttgtggttga 660 

caatgccgga aaacttctct 9tgccctaaa taaggaagcg gctgttcgag gtgacttgct 72 0 

cgacatacta atcacttcca gcgaccaatt tcctgagctt gctgaagctc gccaagcagt 780 

tttagtcatc agggaaaagc tggattcctc gatagcttca tttcgcaaga agctcgctat 840 
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7 . . 

ccgaaacttg gaatttcttc aagtgtcggg gaccacacat ttgatagagc tgcccgttga 900 

ccccaaggcc cccatgaatt gggtgaaagt aaatagcacc aagaagacta ttcgatatca 960 

tcccccagaa atagtagctg gctcggacga gctagcccta gcaactgaac acctcgccac 1020 

tgtgaaccga gcttcgtggg atagtttcct caagagtttc agcagatacc acacagattt 1080 

taaggctgcc gtccaagctc ttgctgcact ggactgtttg cacccccttc caactctatc 1140 

tagaaacaag aactatgtcc gtcccgagtt tgtggatgac tgtgaaccag ttgagataaa 1200 

catacagtct ggtcgtcatc ctgtactgga gactatatta caagataact tcgtcccaaa 1260 

cgacacaatt tcgcatgcag aaggggaata ttgccaaact atcaccggac ctaacatggg 1320 

aggaaagagc tgctatatcc gtcaagntgc tttaatttcc ataatggctc aggttggttc 1380 

ctttgtacca gcgtcattcg ccaagccgca cgtgcttgat ggtgtcttca ctcggatggg 1440 

tgcttcagac agcacccagc atggcagaag tacctttcta gaagaattaa gtgaagcgcc 1500 

acacataatc agaacctgct cttctcgttc gcctgttata ttagatgagc ttggaagagg 1560 

caccagcaca cacgacggtg cagccaccgc ccacgcaaca tcacagcatc ccctagcaga 1620 

aaagagatgt ttggttctct tcgtcacgca ttaccccgaa acagccgaga ccagtaacgg 1680 

attcccaggt cccgctggga cataccacgt ctcgtatctg acattgcaga aggataaagg 1740 

cagttacgat catgatgatg tgacctacct atataagccc gcgcgtggtc tttgcagcag 1800 

gagctttggt tctaaggttg ctcagcrtgc ccagacacct ccatcatgta tacgtcgagc 1860 

catctcaacg gctgcaaaac tggaagctga ggtacgtgca agagagagaa atacacgcat 1920 

gggagaacca gaaggacatg aagaaccgag aggcgcagaa gaatctattt cggctctagg 1980 

tgacttgctt gcagacctga aatttgctct ctctgaagag gacccttgga aagcattcga 2040 

gtttttaaag catgcttgga agattgctgg caaaatcaga ctaaaaccaa cttgttcatt 2100 



ttgacccggg 2110 

<210> 16 

<211> 29 

<212> DNA 

<213> Artificial sequence 



<220> 

V 

<223> MSH3 specific primer S51 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 



<400> 



16 
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ggatcgggta ctgggttttg agtgtgagg 29 

<210> 17 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 17 

aggttctgat tatgtgtgac gctttactta 30 



<210> 18 

<211> 3522 

<212> DNA 

<213> Arabidopsis cfral iana ecotype Columbia 
<220> 

<221> CDS 

<222> (100) .... (3342) 

<223> AtMSH3 full-length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 18 

cctaagaaag cgcgcgaaaa ttggcaaccc aagttcgcca tagccacgac cacgaccttc 60 

catttctctt aaacggagga gattacgaat aaagcaatt 99 

atg ggc aag caa aag cag cag acg att tct cgt ttc ttc get ccc aaa 147 
Met Gly Lys Gin Lys Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys 
15 10 15 

ccc aaa tec ccg act cac gaa ccg aat ccg gta gec gaa tea tea aca 195 
Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

ccg cca ccg aag ata tec gec act gta tec ttc tct cct tec aag cgt 243 
Pro Pro Pro Lys lie Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

aag ctt etc tec gac cac etc gec gee gcg tea ccc aaa aag cct aaa 2 91 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

ctt tct cct cac act caa aac cca gta ccc gat ccc aat tta cac caa 33 9 

Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 
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aga ttt etc 
Arg Phe Leu 



gaa acg tea 
Glu Thr Ser 



gag eta aag 
Glu Leu Lys 
115 

tac agg tac 
Tyr Arg Tyr 
130 

ttg ggt att 
Leu Gly lie 
145 

cca aca ttt 
Pro Thr Phe 



tac aag ate 
Tyr Lys lie 



cat ggt gca 
His Gly Ala 
195 

tat acc aaa 
Tyr Thr Lys 
210 

ggt ggt gaa 
Gly Gly Glu 
225 

gtg gat gag 
Val Asp Glu 



agt ttt gat 
Ser Phe Asp 



gaa gtt gtt 
Glu Val Val 
275 



cag aga ttt 
Gin Arg Phe 
85 

cca teg agg 
Ser Ser Arg 
100 

age' aag tac 
Ser Lys Tyr 



aga ttc ttc 
Arg Phe Phe 



tac get cat 
Tyr Ala His 
150 

cga ttg aat 
Arg Leu Asn 
165 

ggt gta gcg 
Gly Val Val 
180 

aac egg acc 
Asn Arg Thr 



gec acg ctt 
Ala Thr Leu 



gaa ggt ttt 
Glu Gly Phe 
230 

aga gtt aag 
Arg Val Lys 
245 

gtt aga gtc 
Val Arg Val 
260 

tat gaa gag 
Tyr Glu Glu 



ctg gaa ccc 
Leu Glu Pro 



aaa cac aca 
Lys Tyr Thr 
105 

cca gat gtg 
Pro Asp Val 
120 

gga gaa gac 
Gly Glu Asp 
135 

atg gat cac 
Met Asp His 



ttc cac gcg 
Phe His Val 



aag cag act 
Lys Gin Thr 
185 

ggc cct ttt 
Gly Pro Phe 
200 

gaa gcg get 
Glu Ala Ala 
215 

ggt tea cag 
Gly Ser Gin 



teg gag aca 
Ser Glu Thr 



ggt: gtt gtt 
Gly Val Val 
265 

ttc aat gat 
.Phe Asn Asp 
280 



9 

teg ccg gag 

Ser Pro Glu 

* 90 

cca ttg gaa 

Pro Leu Glu 



gtt ttg atg 
Val Leu Met 



gcg gag ate 
Ala Glu lie 
140 

aat ttc atg 
Asn Phe Met 

155 

aga aga ctg 
Arg Arg Leu 
170 

gaa act gca 
Glu Thr Ala 



ttc egg gga 
Phe Arg Gly 



gag gat ata 
Glu Asp lie 
220 

agt aat ttc 
Ser Asn Phe 
235 

tta ggc tgt 
Leu Gly Cys 
250 

ggc gtt gaa 
Gly Val Glu 



aat ttc atg 
Asn Phe Met 



gaa tat gtt 
Glu Tyr Val 
95 

cag caa gtg 
Gin Gin Val 
110 

gtg gaa gtt 
Val Glu Val 
125 

gca gca cgc 
Ala Ala Arg 



acg gcg agt 
Thr Ala Ser 



gtg aat gca 
Val Asn Ala 
175 

gee att aag 
Ala lie Lys 
190 

ctg teg gcg 
Leu Ser Ala 
205 

agt ggt ggt 
Ser Gly Gly 



ttg gtt tgt 
Leu Val Cys 



ggt att gaa 
Gly He Glu 
255 

att teg aca 
He Ser Thr 
270 

aga agt gga 
Arg Ser Gly 
285 



ccc 387 
Pro 



gtg 435 
val 



ggt 483 
Gly 



gtg 531 
Val 



gtg 579 

Val 

160 ' 

gga 627 

Gly. 



tec 675 
Ser 



ttg 723 

Leu 



tgt 771 

Cys 



gtt 819 

Val 

240 

atg 867 
Met 



ggt 915 
Gly 



tta 963 
Leu 
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gag get gtg act ctg age tug tea cca get gag ctg ttg ctt ggc cag 1011 
Glu Ala Val lie Leu Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 
290 295 * 300 

cc: etc tea caa caa act gag aag ttt ttg gtg gca cat get gga cct 1059 
Pro Leu Ser Gin Gin Thr Glu Lys Phe Leu Val Ala Met Ala Gly Pro 
305 310 315 320 

ace tea aac gtt cga gtg gaa cgt gec tea ctg gat tgt ttc- age aat 1107 
Thr Ser Asn Val Arg Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn 
325 330 335 

ggt aat gca gta gat gag gtt att tea tta tgt gaa aaa ate age gca 1155 
Gly Asn Ala Val Asp Glu Val lie Ser Leu Cys Glu Lys lie Ser Ala 
340 345 350 

ggt aac tta gaa gat gat aaa gaa atg aag ctg gag get get gaa aaa 1203 
Gly Asn Leu Glu Asp Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys 
355 360 365 

gga atg tct tgc ttg aca gtt cat aca att atg aac atg cca cat ctg 1251 
Gly Met Ser Cys Leu Thr Val His Thr lie Met Asn Met Pro His Leu 
370 375 360 

act gtt caa gee etc gee eta acg ttt tgc cat etc aaa cag ttt gga 1299 
Thr Val Gin Ala Leu Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly 
385 390 395 400 

ttt gaa agg ate ctt tac caa ggg gee tea ttt cgc tct ttg tea agt 1347 
Phe Glu Arg lie Leu Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser 
405 410 415 

aac aca gag atg act etc tea gee aat act ctg caa cag ttg gag gtt 1395 
Asn Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val 
420 425 430 

gtg aaa aat aat tea gat gga teg gaa tct ggc tec tta ttc cat aat 144 3 

Val Lys Asn Asn Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe His Asn 
435 440 445 

atg aat cac aca ctt aca gta tat get tec agg ctt ctt aga cac tgg 1491 
Met Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg His Trp 
450 455 460 

gtg act cat cct eta tgc gat aga aat ttg ata tct get egg ctt gat 153 9 

Val Thr His Pro Leu Cys Asp Arg Asn Leu lie Ser Ala Arg Leu Asp 
465 470 475 480 

get gtt tct gag att tct get tgc atg gga tct cat agt tct tec cag 1587 
Ala Val Ser Glu lie Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin 
485 490 495 
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etc age age 
Leu Ser Ser 



cct gag ttt 
Pro Glu Phe 
515 

tea tec gat 
Ser Ser Asp 
530 

aaa gee aca 
Lys Ala Thr 
545 

aag caa att 
Lys Gin lie 



acg caa net 
Met Gin Ser 



get act tea 
Val He Ser 
595 

gec eta aat 
Ala Leu Asn 
610 

ate act tec 
He Thr Ser 
625 

gtt tta gtc 
Val Leu Val 



aag aag etc 
Lys Lys Leu 



aca cat ttg 
Thr His Leu 
675 

gtg aaa gta 
Val Lys Val 
690 



gag ttg gtt 
Glu Leu Val 
500 

tat etc gtg 
Tyr Leu Val 



att caa cgt 
He Gin Arg 



gag ttc att 
Glu Phe He 
550 

cag egg ctt 
Gin Arg Leu 
565 

gca act gtg 
Ala Thr Val 
580 

tec cct gtt 
Ser Pro Val 



aag gaa gcg 
Lys Glu Ala 



age gac caa 
Ser Asp Gin 
630 

ate agg gaa 
He Arg Glu 
645 

get att cga 
Ala lie Arg 
660 

ata gag ctg 
He Glu Leu 



aat age acq 
Asn Ser Thr 



gaa gaa ggt 
Glu Glu Gly 
SOS 

etc tec tea 
Leu Ser Ser 
520 

gga ata aca 
Gly He Thr 
535 

gca gtt atg 
Ala Val Met 



ggc ata aag 
Gly He Lys 



cga tct act 
Arg Ser Thr 
585 

gtg gtt gac 
Val Val Asp 
600 

get gtt cga 
Ala Val Arg 
615 

ttt cct gag 
Phe Pro Glu 



aag ctg gat 
Lys Leu Asp 



aat ttg gaa 
Asn Leu Glu 
665 

ccc gtt gat 
Pro Val Asp 
680 

aag aag act 
Lys Lys Thr 
695 



1 1 

tct gag aga 
Ser Glu Arg 



gtc ttg aca 
Val Leu Thr 



aga ate ttt 
Arg lie Phe 
540 

gaa get att 
Glu Ala He 
555 

caa gac tct 
Gin Asp Ser 
570 

ctt ttg aga 
Leu Leu Arg 



aat gee gga 
Asn Ala Gly 



ggt gac ttg 
Gly Asp Leu 
620 

ctt get gaa 
Leu Ala Glu 
635 

tec teg ata 
Ser Ser He 
650 

ttt ctt caa 
Phe Leu Gin 



tec aag gtc 
Ser Lys Val 



att cga tat 
He Arg Tyr 
700 



gca att gta 
Ala He Val 
510 

get atg tct 
Ala Met Ser 
525 

cat egg act 
His Arg Thr 



tta ctt gcg 
Leu Leu Ala 



gaa atg agg 
Glu Met Arg 
575 

aaa ttg att 
Lys Leu lie 
590 

aaa ctt etc 
Lys Leu Leu 
605 

etc gac ata 
Leu Asp He 



get cgc caa 
Ala Arg Gin 



get tea ttt 
Ala Ser Phe 
655 

gtg teg ggg 
Val Ser Gly 
670 

cct atg aat 
Pro His Asn 
685 

cat ccc cca 
His Pro Pro 



tea 1635 
Ser 



aga 1683 
Arg 



get 1731 
Ala 



ggg 1779 

Gly 
560 

agt 1827 

Ser 



tct 1875 
Ser 



tct 1923 
Ser 



eta 1971 
Leu 



gca 2019 

Ala 

640 

cgc 2067 
Arg 



ate 2115 
He 



tgg 216 3 

Trp 



gaa 2211 
Glu 
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aca gca get ggc teg gat gag eta get eta gca act gaa cat etc gee 
He Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 * 715 720 



2259 



att gtg aac cga get teg tgg gat agt tec etc aag agt ttc agt aga 2307 
He Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 



tac tac aca gat ttt aag get gec gtt caa get ctt get gca ctg gac 2355 
Tyr Tyr Thr Asp Phe Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

tgt ttg cac tec ctt tea act eta tct aga aac aag aac tat gtc cgt 2403 
Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 



ccc gag ttt gtg gat gac tgt gaa cca gtt gag ata aac ata cag tct 2451 
Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu lie Asn lie Gin Ser 
770 775 780 



ggt cgt cat cct gta ctg gag act ata tea caa gat aac ttc gtc cca 
Gly Arg His Pro Val Leu Glu Thr He Leu Gin Asp Asn Phe Val Pro 
785 790 795 800 



2499 



aat gac aca att ttg cat gca gaa ggg gaa tat tgc caa att ate acc 2547 
Asn Asp Thr He Leu His Ala Glu Gly Glu Tyr Cys Gin He lie Thr 
805 810 815 



gga cct aac atg gga gga aag age tgc tat ate cgt caa gtt get tta 2595 
Gly Pro Asn Met Gly Gly Lys Ser Cys Tyr He Arg Gin Val Ala Leu 
820 825 830 



att tec ata atg get cag gtt ggt tec ttt gta cca gcg tea ttc gee 
He Ser He Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
835 840 845 



2643 



aag ctg cac gtg ctt gat ggt gtt ttc act egg atg ggt get tea gac 2691 

Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Met Gly Ala Ser Asp 

850 855 860 

agt ate cag cat ggc aga agt acc ttt eta gaa gaa tta agt gaa gcg 273 9 

Ser He Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
865 870 875 880 

tea cac ata ate aga acc tgt tct tct cgt teg ctt gtt ata tta gat 2787 

Ser His lie He Arg Thr Cys Ser Ser Arg Ser Leu Val He Leu Asp 
885 890 895 



gag ctt gga aga ggc actj ( age aca cac gac ggt gta gee att gec tat 283 5 
Glu Leu Gly Arg Gly Thr Ser Thr His Asp Gly Val Ala He Ala Tyr 
900 905 910 
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gca aca eta cag cat etc eta gca gaa aag aga tgt ttg gtt ctt ttt 2883 

Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 

915 920 * 925 

gtc acg cat tac cct gaa ata get gag ate agt aac gga ttc cca ggt 2931 
Val Thr His Tyr Pro Glu lie Ala Glu lie Ser Asn Gly Phe Pro Gly 
930 935 940 

tct gtt ggg aca tac cat gtc teg tat ctg aca ttg cag aag gat aaa 2979 
Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys ; 
945 950 955 960 

ggc agt tat gat cat gat gat gtg acc tac eta tat aag ctt gtg cgt 3027 
Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

ggt ctt tgc age agg age ttt ggt ttt aag gtt get cag ctt gec cag 3075 
Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

ata cct cca tea tgt ata cgt cga gee act tea acg get gca aaa ttg 3123 
lie Pro Pro Ser Cys lie Arg Arg Ala lie Ser Met Ala Ala Lys Leu 
995 1000 1005 

gaa get gag gta cgt gca aga gag aga aa: aca cgc atg gga gaa cca 3171 
Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 • 1020 

gaa gga cat gaa gaa ccg aga ggc gca gaa gaa tct act teg get eta 3219 
Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser lie Ser Ala Leu 
1025 1030 1035 1040 

ggt gac ttg ttt gca gac ctg aaa ttt get etc tct gaa gag gac cct 3267 
Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

tgg aaa gca ttc gag ttt tta aag cat get tgg aag att get ggc aaa 3315 
Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys He Ala Gly Lys 
1060 1065 1070 

ate aga eta aaa cca act tgt tea ttt tgatttaatc ttaacattat 3362 
He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 

ageaactgea aggtcttgat catctgttag ttgegtacta acttatgtgt attagtataa 3422 

caagaaaaga gaattagaga gatggattct aatccggtgt tgcagtacat cttttctcca 3482 

cccgcataaa aaaaaaaaaa a,aaaaaaaaa aaaaaaaaaa 3522 



<210> 19 
<211> 1081 
<212> PRT 
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<213> Arahidopsis chaliana ecotype Columbia 

<223> Polypeptide MSH3 

<400> 19 

Met Gly Lys Gin Lys Gin Gin Thr He Ser Arg Phe Phe Ala Pro Lys 
15 10 15 

Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 
20 25 30 

Pro Pro Pro Lys He Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 
35 40 45 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 
50 55 60 

Leu Ser Pro His Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin 
65 70 75 80 

Arg Phe Leu Gin Arg Phe Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro 

85 90 95 

Glu Thr Ser Ser Ser Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
100 105 110 

Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu Met Val Glu Val Gly 
115 120 125 

Tyr Arg Tyr Arg Phe Phe Gly Glu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 

Leu Gly lie Tyr Ala His Met Asp His Asn Phe Mec Thr Ala Ser Val 
145 150 155 160 

Pro Thr Phe Arg Leu Asn Phe His Val Arg Arg Leu Val Asn Ala Gly 
165 170 175 

Tyr Lys He Gly Val Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser 
180 185 190 

His Gly Ala Asn Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 200 205 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp He Ser Gly Gly Cys 
210 215 220 

Gly Gly Glu Glu Gly Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val 
225 23Q f 235 240 

Val Asp Glu Arg Val Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met 
245 250 255 
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Ser Phe Asp Val 
260 

Glu Val Val Tyr 
275 

Glu Ala Val lie 
290 

Pro Leu Ser Gin 
305 

Thr Ser Asn Val 



Gly Asn Ala Val 
340 

Gly Asn Leu Glu 
355 

Gly Met Ser Cys 
370 

Thr Val Gin Ala 
385 

Phe Glu Arg lie 



Asn Thr Glu Met 
420 

Val Lys Asn Asn 
435 

Met Asn His Thr 
450 

Val Thr His Pro 
465 

Ala Val Ser Glu 



Leu Ser Ser Glu 
500 

Pro Glu Phe Tyr 
515 

Ser Ser Asp lie 
530 



Arg Val Gly Val 



Glu Glu Phe Asn 
280 

Leu Ser Leu Ser 
295 

Gin Thr Glu Lys 
310 

Arg Val Glu Arg 
325 

Asp Glu Val lie 



Asp Asp Lys Glu 
360 

Leu Thr Val His 
375 

Leu Ala Leu Thr 
390 

Leu Tyr Gin Gly 
405 

Thr Leu Ser Ala 



Ser Asp Gly Ser 
440 

Leu Thr Val Tyr 
455 

Leu Cys Asp Arg 
470 

lie Ser Ala Cys 



Leu Val Glu Glu 



Leu VaOL Leu Ser 
520 

Gin Arg Gly lie 
535 
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Val Gly Val Glu 
265 

Asp Asn Phe Met 



Pro Ala Glu Leu 
300 

Phe Leu Val Ala 
315 

Ala Ser Leu Asp 
330 

Ser Leu Cys Glu 
345 

Met Lys Leu Glu 



Thr lie Met Asn 
380 

Phe Cys His Leu 
395 

Ala Ser Phe Arg 
410 

Asn Thr Leu Gin 
425 

Glu Ser Gly Ser 



Gly Ser Arg Leu 
460 

Asn Leu lie Ser 
475 

Met Gly Ser His 
490 

Gly Ser Glu Arg 
505 

Ser Val Leu Thr 



Thr Arg lie Phe 
540 



lie Ser Thr Gly 
270 

Arg Ser Gly Leu 
285 

Leu Leu Gly Gin 



Met Ala Gly Pro 
320 

Cys Phe Ser Asn 
335 

Lys lie Ser Ala 
350 

Ala Ala Glu Lys 
365 

Met Pro His Leu 



Lys Gin Phe Gly 
400 

Ser Leu Ser Ser 
415 

Gin Leu Glu Val 
430 

Leu Phe His Asn 
445 

Leu Arg His Trp 



Ala Arg Leu Asp 
480 

Ser Ser Ser Gin 
495 

Ala He Val Ser 
510 

Ala Met Ser Arg 
525 

His Arg Thr Ala 
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Lys Ala Thr Glu Phe He Ala Val Met Glu Ala He Leu Leu Ala Gly 
545 550 555 560 

Lys Gin He Gin Arg Leu Gly lie Lys Gin Asp Ser Glu Met Arg Ser 
565 570 575 

Met Gin Ser Ala Thr Val Arg Ser Thr Leu Leu Arg Lys Leu He Ser 
580 585 590 

Val He Ser Ser Pro Val Val Val Asp Asn Ala Gly Lys Leu Leu Ser 
595 600 605 

Ala Leu Asn Lys Glu Ala Ala Val Arg Gly Asp Leu Leu Asp He Leu 
610 615 620 

He Thr Ser Ser Asp Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala 
625 630 635 640 

Val Leu Val lie Arg Glu Lys Leu Asp Ser Ser He Ala Ser Phe Arg 
645 650 655 

Lys Lys Leu Ala lie Arg Asn Leu Glu Phe Leu Gin Val Ser Gly He 
660 665 670 

Thr Kis Leu He Glu Leu Pro Val Asp Ser Lys Val Pro His Asn Trp 
675 680 685 

Val Lys Val Asn Ser Thr Lys Lys Thr lie Arg Tyr His Pro Pro Glu 
690 695 700 

lie Val Ala Gly Leu Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala 
705 710 715 720 

lie Val Asn Arg Ala Ser Trp Asp Ser Phe Leu Lys Ser Phe Ser Arg 
725 730 735 

Tyr Tyr Thr Asp Phe Lys Ala Ala Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 

Cys Leu His Ser Leu Ser Thr Leu Ser Arg Asn Lys Asn Tyr Val Arg 
755 760 765 

Pro Glu Phe Val Asp Asp Cys Glu Pro Val Glu He Asn He Gin Ser 
770 775 780 

Gly Arg His Pro Val Leu Glu Thr He Leu Gin Asp Asn Phe Val Pro 
785 790 795 800 

Asn Asp Thr He Leu His Ala Glu Gly Glu Tyr Cys Gin He lie Thr 
805 810 815 



Gly Pro Asn Met Gly Gly Lys Ser Cys Tyr He Arg Gin Val Ala Leu 
820 825 830 
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lie Ser lie Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
835 840 845 

Lys Leu His Val Leu Asp Gly Val Phe Thr Arg Met Gly Ala Ser Asp 
850 855 860 

Ser lie Gin His Gly Arg Ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
865 870 875 880 

Ser His lie lie Arg Thr Cys Ser Ser Arg Ser Leu Val lie Leu Asp 
885 890 895 

Glu Leu Gly Arg Gly Thr Ser Thr His Asp Gly Val Ala lie Ala Tyr 
900 905 910 

Ala Thr Leu Gin His Leu Leu Ala Glu Lys Arg Cys Leu Val Leu Phe 
915 920 925 

Val Thr His Tyr Pro Glu lie Ala Glu lie Ser Asn Gly Phe Pro Gly 
930 935 940 

Ser Val Gly Thr Tyr His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys 
945 950 955 960 

Gly Ser Tyr Asp His Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg 
965 970 975 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 985 990 

lie Pro Pro Ser Cys lie Arg Arg Ala lie Ser Met Ala Ala Lys Leu 
995 1000 1005 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 
1010 1015 1020 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser lie Ser Ala Leu 
1025 1030 1035 1040 

Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys lie Ala Gly Lys 
1060 1065 1070 

lie Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 



<210> 20 

<211> 24 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> MSH6 specific primer 638 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 

<400> 20 

tctctaccag gtgacgaaaa accg * 24 



<210> 21 . 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer S81 for PCR using cDNA of Arabidopsis thaliana ecotype 
Columbia 

<400> 21 



cgtcgccttt agcatcccct tccttcac 28 



<210> 22 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S823 for PCR using cDNA of Arabidopsis' 
thaliana ecotype Columbia 

<400> 22 



gcttggcgca tctaatagaa tcatgacagg 3 0 



<210> 23 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 637 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 23 

gacagcgtca gttcttcaga atgc 24 



<210> 24 
<211> 33 
<212> DNA 
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<213> Artificial sequence 

<22Q> 

<223> MSH6 specific primer 1S3 for PCR using cDNA of Arabidopsis 

thaliana ecotype Columbia 

<400> ' 24 

atcccgggat gcagcgccag agatcgattt tgt 33 

<210> 25 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer S83 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<40C> 25 



cgc:a:cca: 


ggccgcc teg 


aaccgag 








27 


<21C> 


26 












<211> 


1385 












<212> 


DNA 












<213> 


Arabidopsis thai iana 


ecotype Columbia 






<223> 


Clone 43 












<40C> 


26 












cccgggatgc 


agegecagag 


atcgattttg 


tctttcttcc 


aaaaacccac 


ggeggegact 


60 


acgaagggtt 


tggtttccgg 


egatgetget 


ageggegggg 


geggcagegg 


aggaccacga 


120 


tttaatgtga 


aggaagggga 


tgetaaagge 


gaegcttctg 


tacgttttgc 


tgtttcgaaa 


180 


tctgtcgatg 


aggttagagg 


aacggatact 


ccaccggaga 


aggttccgcg 


tcgtgtcctg 


240 


ccgtctggat 


ttaagcegge 


tgaatccgcc 


ggtgatgctt 


cgtccctgtt 


ctccaatatt 


300 


atgcataagt 


ttgtaaaagt 


cgatgatcga 


gattgttctg 


gagagaggag 


ccgagaagat 


360 


gttgttccgc 


tgaatgattc 


atctctatgt 


atgaaggcta 


atgatgttat 


tcctcaattt 


420 


cgttccaata 


atggtaaaac 


tcaagaaaga 


aaccatgett 


ttagtttcag 


tgggagagct 


480 


gaacttagat 


cagtagaaga 


tataggagta 


gatggcgatg 


ttcctggtcc 


agaaacacca 


540 


gggatgcgtc 


cacgtgcttc 


tegcttgaag 


cgagttctgg 


aggatgaaat 


gacttttaag 


600 


gaggataagg 


ttcctgtatt 


ggactccaac 


aaaaggctga 


aaatgetcca 


ggatccggtt 


660 
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tgtggagaga agaaagaagt aaacgaagga accaaacttg aatggcttga gtcttctcga 720 

atcagggatg ccaatagaag acgtcctgat gacccccttt acgacagaaa gaccttacac 730 

ataccacctg atgttrttcaa gaaaatgtcc gcatcacaaa agcaatattg gagtgttaag 840 

agtgaatata tggacattgt gcttttcttt aaagtgggga aattttatga gctgtatgag 900 

ctagacgcgg aattaggtca caaggagctt gactggaaga tgaccatgag tggtgtggga 960 

aaatgcagac aggttggtat ctctgaaagt gggacagatg aggcagtgca aaagctatta 1020 

gctcgtggat ataaagtcgg acgaatcgag cagctagaaa catctgacca agcaaaagcc 1080 

agaggtgcta atactataat tccaaggaag ctagttcagg tattaactcc atcaacagca 1140 

agcgagggaa acatcgggcc tgatgccgtc canctccttg ctataaaaga gatcaaaatg 1200 

gagctacaaa agtgttcaac egcgcatgga c ttgcctttg ttgactgtgc tgccttgagg 1260 

t:ttcggg"g ggtccatcag cgacga:gca ccatgtgctg cccttggagc gttattgaug 1320 

caggzzzczc caaaggaagc g:ra:angac agtaaagggc tatcaagaga agcacaaaag 1330 

gctccaagga aatatacgtc gacagggccc acggcggcac agttggcccc agtaccacaa 1440 

gtaatggggg acacagatgc tgctggagct: agaaatataa tagaatctaa cggatacttt 1500 

aaaggctccc ctgaatcatg gaactgtgct gttgatggcc taaatgaatg tgatgttgcc 1550 

cccagngctc ttggagagct aattaaccat ctgcctaggc taaagctaga agatgtactt 1620 

aagcacgggg atatttttcc ataccaagcc tacaggggtt gcctcagaat cgatggccag 1680 

acgacggcaa atcttgagat atttaacaat agctgtgatg gtggtccttc agggaccttg 1740 

tacaaatatc ttgataactg tgttagtcca actggtaagc gactcttaag gaattggatc 1800 

tgccacccac tcaaagatgt agaaagcatc aataaacggc ttgatgtagt tgaagaattc 1860 

acggcaaact cagaaagtat gcaaatcacc ggccagcatc tccacaaact tccagactta 1920 

gaaagactgc tcggacgcat caacjtctagc gttcgatcat cagcctctgt gttgcccgct 1980 

cttctgggga aaaaagtgct gaaacaacga gttaaagcat ttgggcaaat tgtgaaaggg 2 040 

ttcagaagtg gaattgatct gttgttggct ctacagaagg aatcaaatat gatgagtttg 2100 

ctttataaac tctgtaaact fccctatatta gtaggaaaaa gcgggccaga gttacttctt 2160 

tctcaattcg aagcagccat agatagcg 2188 
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<210> 27 
<211> 1385 
<212> DNA 

<213> Arabidopsis chaliana ecotype Columbia 

<223> Clone 62 

<400> 27 

catcagcctc tgtgttgcct gctcttctgg ggaaaaaagt gccgaaacaa cgagttaaag 60 

catttgggca aattgtgaaa gggttcagaa gtggaattga tctgttgttg gctctacaga 120 

aggaatcaaa tatgatgagt ttgctctata aactctgtaa acttcctata ttagtaggaa 180 

aaagcgggct agagtcattt ctttctcaat tcgaagcagc catagatagc gactttccaa 240 

attatcagaa ccaagatgtg acagatgaaa acgctgaaac tctcacaata cttatcgaac 3 00 

tttttatcga aagagcaact caatggtctg aggtcattca caccataagc tgcctagatg 360 

tcctgagatc ttttgcaatc gcagcaagtc tctctgctgg aagcatggcc aggcctgtca 420 

tttctcccga atcagaagct acagaccaga accagaaaac aaaagggcca atacttaaaa 480 

tccaaggact atggcatcca tttgcagttg cagccgatgg tcaattgcct gttccgaatg 540 

atatactcct tggcgaggct agaagaagca gtggcagcat tcatcctcgg tcattgttac 600 

tgacgggacc aaacatgggc ggaaaatcaa ctcttcttcg tgcaacatgt ctggccgtta 660 

tctttgccca acttggctgc tacgtgccgt gtgagtcttg cgaaatctcc ctcgtggata 720 

ctatcttcac aaggcttggc gcatctgata gaatcatgac aggagagagt acctttttgg 780 

tagaatgcac tgagacagcg tcagttcttc agaatgcaac tcaggattca ctagtaatcc 840 

ttgacgaact gggcagagga actagtactt tcgatggata cgccattgca tactcggttt 900 

ttcgtcacct ggtagagaaa gttcaatgtc ggatgctctt tgcaacacat taccaccctc 960 

tcaccaagga attcgcgtct cacccacgtg tcacctcgaa acacatggct tgcgcattca 102 0 

aatcaagatc tgattatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 1080 

taaccgaggg agcttgtcct gagagctacg gacttcaagt ggcactcatg gctggaatac 114 0 

caaaccaagt ggttgaaaca gcatcaggtg ctgctcaagc catgaagaga tcaattgggg 120 0 

aaaacttcaa gtcaagtgag ctaagatctg agttctcaag tctgcatgaa gactggctca 126 0 

agtcattggt gggtatttct cgagtcgccc acaacaatgc ccccattggc gaagatgact 132 0 

acgacacttt gttttgctta tggcatgaga tcaaatcctc ttactgtgtt cccaaataac 138 0 
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ccggg 13 8 5 



<210> 28 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MSH6 specific primer 2S8 for PCR using cDNA of Arabidopsis ; 
thaliana ecotype Columbia 

<400> 26 

atcccgggtt atttgggaac acagtaagag gatt 34 

<210> 29 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<22C> 

<223> MSH6 specific primer S82 fcr PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia 

<400> 29 

gcgttcgatc atcagcctct gtgttgc 27 



<210> 30 

<211> 3606 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> CDS 

<222> (142) (3468) 

<223> AtMSH6 full-length cDNA and deduced sequence of the encoded 
polypeptide 

<400> 30 



aaaagttgag ccctgaggag tatigtttcc gccatttcta cgacgcaagg cgaaaatttt 6 0 
tggcgccaat ctttcccccc tttcgaattc tctcagctca aaacatcgtt tctctctcac 120 



tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg tct ttc 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe 
15 10 



171 
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tec caa aaa ccc acc gcg gcg act acg aag ggt ttg gtt tec ggc gat 219 
Phe Gin Lys Pro Thr Ala Ala Thr Thr Lys Gly Leu Val Ser Gly Asp 

15 '20 25 

get get age ggc ggg ggc ggc age gga gga cca cga ttt aat gtg aag 267 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Gly Pro Arg Phe Asn Val Arg 
30 35 40 

gaa ggg gat get aaa ggc gac get tct gta cgt ttt get gtt teg aaa 315 
Glu Gly Asp Ala Lys Gly Asp Ala Ser Val Arg Phe Ala Val Ser Lys 
45 50 55 

tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 363 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lys Val Pro 
60 65 70 

cgt cgt gtc ctg ccg tct gga ttt aag ccg get gaa tec gee gst gat 411 
Arg Arg Val Leu Pro Ser Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp 
75 80 85 90 

get teg tec ctg ttc tec aac arc acg cat aag ttt gta aaa gtc gat 459 
Ala Ser Ser Leu Phe Ser Asr. He Met His Lys Phe Val Lys Val Asp 

95 100 105 

gat cga gat tgt tct gga gag age age cga gaa gat gtt gtt ccg ctg 507 
Asp Arg Asp Cys Ser Gly Glu Arg Ser Arg Glu Asp Val Val Pro Leu 
110 115 120 

aat gat tea tct eta tgt atg aag get aat gat gtt att cct caa ttt 555 
Asn Asp Ser Ser Leu Cys Met Lys Ala Asn Asp Val He Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ser Asn Asn Gly Lys Thr Gin Glu Arg Asn His Ala Phe Ser Phe 
140 145 150 

agt ggg aga get gaa ctt aga tea gta gaa gat ata gga gta gat ggc 651 
Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp He Gly Val Asp Gly 
155 160 165 170 

gat gtt cct ggt cca gaa aca cca ggg atg cgt cca cgt get tct cgc 699 

Asp Val Pro Gly Pro Glu Thr Pro Gly Met Arg Pro Arg Ala Ser Arg 
175 180 185 

t 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 747 
Leu Lys Arg Val Leu Glu Asp Glu Met Thr Phe Lys Glu Asp Lys Val 
190 195 200 



cct gta ttg gac tct aaq aaa agg ctg aaa atg etc cag gat ccg gtt 
Pro Val Leu Asp Ser Asn Lys Arg Leu Lys Met Leu Gin Asp Pro Val 
205 210 215 



795 
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tgt gga gag aag aaa gaa gta aac gaa gga acc aaa ttt gaa egg ctt 843 
Cys Gly Giu Lys Lys Glu Val Asn Glu Gly Thr Lys Phe Glu Trp Leu 
220 225 * 230 



gag tct cc: cga acc agg gac gec aat aga aga cgt cct gat gat ccc 891 
Glu Ser Ser Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 250 



ctt tac gat aga aag acc tta cac ata 
Leu Tyr Asp Arg Lys Thr Leu His lie 
255 



cca cct gat gtt ttc aag aaa 939 
Pro Pro Asp Val Phe Lys Lys ; 
260 265 



atg tct gca tea caa aag caa tat tgg agt gtt aag agt gaa tat atg 987 
Met Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met 
270 275 280 



gac att gtg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ctg tat gag 1035 
Asp lie Val Leu Phe Phe Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu 
295 290 295 



eta gat gcg gaa tta got cac aag gag ctt gac tgg aag atg acc atg 1083 
Leu Asp Ala Glu Leu Gly His Lys Glu Leu Asp Trp Lys Met Thr Met 
300 305 310 



agt ggt gtg gga aaa ngc aga cag gtz ggt ate tct gaa agt ggg ata 1131 
Ser Gly Val Gly Lys Cys Arg Gin Val Gly lie Ser Glu Ser Gly lie 
315 320 325 330 



gat gag gca gtg caa aag eta tta get cgt gga tat aaa gtt gga cga 1179 
Asp Glu Ala Val Gin Lys Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg 
335 340 345 



ate gag cag eta gaa aca tct gac caa gca aaa gec aga ggt get aat 1227 
lie Glu Gin Leu Glu Thr Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn 
350 355 360 

act ata att cca agg aag eta gtt cag gta tta act cca tea aca gca 1275 
Thr lie lie Pro Arg Lys Leu Val Gin Val Leu Thr Pro Ser Thr Ala 
365 370 375 



age gag gga aac ate ggg cct gat gec gtc cat ctt ctt get ata aaa 1323 

Ser Glu Gly Asn lie Gly Pro Asp Ala Val His Leu Leu Ala lie Lys 

380 385 390 

gag ate aaa atg gag eta caa aag tgt tea act gtg tat gga ttt get 1371 

Glu lie Lys Met Glu Leu Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala 
395 400 405 410 

ttt gtt gac tgc get gc,c ttg agg ttt tgg gtt ggg tec ate age gat 1419 

Phe Val Asp Cys Ala Ala Leu Arg Phe Trp Val Gly Ser lie Ser Asp 
415 420 425 
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gat gca tea tgt get get etc gga gcg tta ttg atg cag gtt tct cca 1467 
Asp Ala Ser Cys Ala Ala Leu Gly Ala Leu Leu Met Gin Val Ser Pro 
430 435 * 440 



aag gaa gtg tta tat gac agt aaa ggg eta tea aga gaa gca caa aag 1515 
Lys Glu Val Leu Tyr Asp Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys 
445 450 455 



get eta agg aaa tat acg ttg aca ggg tct acg gcg gta cag ttg get 1563 
Ala Leu Arg Lys Tyr Thr Leu Thr Gly Ser Thr Ala Val Gin Leu Ala ; 
460 465 470 



cca gta cca caa gta atg ggg gat aca gat get get gga gtt aga aat 1611 
Pro Val Pro Gin Val Met Gly Asp Thr Asp Ala Ala Gly Val Arg Asn 
475 480 485 490 



ata ata gaa tct aac gga tac ttt aaa 
He He Glu Ser Asn Gly Tyr Phe Lys 
495 



ggt tct tct gaa tea tgg aac 1659 
Gly Ser Ser Glu Ser Trp Asn 
500 505 



tgt get gtt gat ggt eta aat gaa tgt gat gtt gee ctt agt get ctt 1707 
Cys Ala Val Asp Giy Leu Asr. Glu Cys Asp Val Ala Leu Ser Ala Leu 
510 515 520 



gga gag eta att aat cat 
Gly Glu Leu He Asn His 
525 

aag cat ggg gat att ttt 
Lys His Gly Asp He Phe 
540 

att gat ggc cag acg atg 
lie Asp Gly Gin Thr Met 
555 560 



ctg tct agg eta aag eta 
Leu Ser Arg Leu Lys Leu 
530 

cca tac caa gtt tac agg 
Pro Tyr Gin Val Tyr Arg 
545 550 

gta aat ctt gag ata ttt 
Val Asn Leu Glu He Phe 
565 



gaa gat gta ctt 1755 

Glu Asp Val Leu 

535 

ggt tgt etc aga 1803 
Gly Cys Leu Arg 

aac aat age tgt 1851 
Asn Asn Ser Cys 
570 



gat ggt ggt cct tea ggg ace ttg tac aaa tat ctt gat aac tgt gtt 1899 
Asp Gly Gly Pro Ser Gly Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val 
575 580 585 



agt cca act ggt aag cga 
Ser Pro Thr Gly Lys Arg 
590 

aaa gat gta gaa age ate 
Lys Asp Val Glu Ser He 
605 

acg gca aac tea gaa agp 
Thr Ala Asn Ser Glu Ser 
620 



etc tta agg aat tgg ate 
Leu Leu Arg Asn Trp He 
595 

aat aaa egg ctt gat gta 
Asn Lys Arg Leu Asp Val 
610 

atg caa ate act ggc cag 
Met Gin He Thr Gly Gin 
625 630 



tgc cat cca etc 1947 
Cys His Pro Leu 
600 

gtt gaa gaa ttc 1995 

Val Glu Glu Phe 

615 

tat etc cac aaa 2043 
Tyr Leu His Lys 
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Leu Pro Asp 
635 

tea tea gee 
Ser Ser Ala 



caa cga gtt 
Gin Arg Val 



att gat ctg 
lie Asp Leu 
685 

ctt tat aaa 
Leu Tyr Lys 
700 

gag tta ttt 
Glu Leu ?he 
715 

aat tat cag 
Asn Tyr Gin 



ata ctt ate 
lie Leu lie 



att cac ace 
lie His Thr 
765 

gca agt etc 
Ala Ser Leu 
780 

tea gaa get 
Ser Glu Ala 
795 

ate caa gga 
He Gin Gly 



cct gtt ccg 
Pro Val Pro 



tta gaa aga 
Leu Glu Arg 
640 

tct gtg ttg 
Ser Val Leu 
655 

aaa gca ttt 

Lys Ala Phe 
670 

ttg ttg get 

Leu Leu Ala 



etc tgt aaa 

Leu Cys Lys 



ctt tct caa 
Leu Ser Gin 
720 

aac caa gat 
Asn Gin Asp 
735 

gaa ctt ttt 
Glu. Leu Phe 
750 

ata age tgc 
He Ser Cys 



tct get gga 
Ser Ala Gly 



aca gat cag 
Thr Asp Gin 
800 

eta tgg cat 
Leu Trp His 
815 

aat gat at t a 
Asn Asp He 
830 



ctg etc gga 
Leu Leu Gly 



cct get ctt 
Pro Ala Leu 



ggg caa att 
Gly Gin He 
675 

eta cag aag 
Leu Gin Lys 

690 

ctt cct ata 
Leu Pro He 
705 

tte gaa gca 
Phe Glu Ala 



gtg aca gat 
Val Thr Asp 



ate gaa aga 
lie Glu Arg 
755 

eta gat gtc 
Leu Asp Val 
770 

age atg gec 
Ser Met Ala 
785 

aat cag aaa 
Asn Gin Lys 



Oca ttt gca 
Pro Phe Ala 



etc ctt ggc 
Leu Leu Gly 
835 



26 

cgc ate aag 
Arg He Lys 
645 

ctg ggg aaa 
Leu Gly Lys 
660 

gtg aaa ggg 
Val Lys Gly 



gaa tea aat 
Glu Ser Asn 



tta gta gga 

Leu Val Gly 
710 

gec ata gat 
Ala He Asp 
725 

gaa aac get 
Glu Asn Ala 
740 

gca act caa 
Ala Thr Gin 



ctg aga tct 
Leu Arg Ser 



agg cct gtt 
Arg Pro Val 
790 

aca aaa ggg 
Thr Lys Gly 
805 

gtt gca gec 
Val Ala Ala 
820 

gag get aga 
Glu Ala Arg 



tct age gtt 
Ser Ser Val 



aaa gtg ctg 
Lys Val Leu 
665 

ttc aga agt 
Phe Arg Ser 
680 

atg atg agt 
Met Met Ser 

695 

aaa age ggg 
Lys Ser Gly 



age gac tct 
Ser Asp Phe 



gaa act etc 
Glu Thr Leu 
745 

tgg tct gag 
Trp Ser Glu 
760 

ttt gca ate 
Phe Ala lie 
775 

att ttt ecc 
He Phe Pro 



cca ata ctt 
Pro He Leu 



gat ggt caa 
Asp Gly Gin 
825 

aga age agt 
Arg Ser Ser 
840 



cga 2091 

Arg 

650 

aaa 213 9 

Lys 



gga 2187 
Gly 



ttg 2235 
Leu 



eta 2283 
Leu 



cca 2331 

Pro 

730 

aca 2379 
Thr 



gtc 2427 
Val 



gca 2475 
Ala 



gaa 2523 
Glu 



aaa 2571 

Lys 

810 

ttg 2619 
Leu 



ggc 2667 
Gly 
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age act cat cct egg tea ttg tea ccg'acg gga cca aac atg ggc gga 2715 
Ser lie His Pro Arg Ser Leu Leu Leu Thr Gly Pro Asn Met Gly Gly 
845 350 * 855 

aaa tea act ctt ctt cgt gca aca tgt ctg gee gtt ate ttt gee caa 2763 
Lys Ser Thr Leu Leu Arg Ala Thr Cys Leu Ala Val lie Phe Ala Gin 
860 865 870 

ctt ggc tgc tac gtg ccg tgt gag tct tgc gaa ate tec etc gtg gat 2811 
Leu Gly Cys Tyr Val Pro Cys Glu Ser Cys Glu lie Ser Leu Val Asp 
875 880 885 890 

act ate ttc aca agg ctt ggc gca tct gat aga ate atg aca gga gag 2859 
Thr lie Phe Thr Arg Leu Gly Ala Ser Asp Arg lie Met Thr Gly Glu 
895 900 905 

agt ace ttt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag aat 2907 
Ser Thr Phe Leu Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca act cag gat tea eta gta ate etc gac gaa ctg ggc aga gga act 2955 
Ala Thr Gin Asp Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr 
925 930 935 

agt act ttc gat gga cac gee acc gca cac teg gtt ttt cgt cac ctg 3003 
Ser Thr Phe Asp Gly Tyr Ala lie Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gta gag aaa gtt caa tgt egg atg etc ttt gca aca cat tac cac cct 3051 
Val Glu Lys Val Gin Cys Arg Met Leu Phe Ala Thr His Tyr Kis Pro 
955 960 965 970 

etc acc aag gaa ttc gcg tct cac cca cgt gtc acc ccg aaa cac atg 3099 
Leu Thr Lys Glu Phe Ala Ser His Pro Arg Val Thr Ser Lys His Met 
975 980 985 

get tgc gca ttc aaa tea aga tct gat tat caa cca cgt ggt tgt gat 3147 
Ala Cys Ala Phe Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
990 995 1000 

caa gac eta gtg ttc ttg tac cgt tta acc gag gga get tgt cct gag 3195 
Gin Asp Leu Val Phe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 1015 

age tac gga ctt caa gtg gca etc atg get gga ata cca aac caa gtg 3243 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly lie Pro Asn Gin Val 
1020 1025 1030 



gtt gaa aca gca tea ggt^ get get caa gee atg aag aga tea att ggg 
val Glu Thr Ala Ser Gly Ala Ala Gin Ala Met Lys Arg Ser lie Gly 
1035 1040 1045 1050 



3291 
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gga aac ttc aag tea agt gag eta aga tec gag ttc tea agt ctg cat 3339 

Glu Asn Phe Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His 
1055 1060 1065 

gaa gac tgg etc aag tea ttg gtg ggt att tct cga gtc gee cac aac 3387 
Glu Asp Trp Leu Lys Ser Leu Val Gly lie Ser Arg Val Ala His Asn 
1070 1075 1080 

aat gec ccc att ggc gaa gat gac tac gac act ttg ttt tgc tta tgg 3435 
Asn Ala Pro lie Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp ; 
1085 1090 1095 

cat gag ate aaa tec tet tac tgt gtt ccc aaa taaatggcta 3478 
His Glu lie Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

tgacataaca ctatctgaag ctcgttaagt cttttgcctc tctgatgttt attcctctta 3538 

aaaaatgett atatatcaaa aaattg:::c ctcgattaaa aaaaaaaaaa aaaaaaaaaa 3598 

aaaaaaaa 3606 



<210> 31 

<211> 1109 

<212> PRT 

<213> Arabidopsis thaliana ecotype Columbia 

<223> Polypeptide MSH6 

<400> 31 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe Phe Gin Lys Pro Thr Ala 
15 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 
20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Arg Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 
50 55 60 

Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 
65 70 ' 75 80 

Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp Ala Ser Ser Leu Phe Ser 

85 90 95 

Asn lie Met His Lys Phe Val Lys Val Asp Asp Arg Asp Cys Ser Gly 
100 105 110 



Glu Arg Ser Arg Glu Asp Val Val Pro Leu Asn Asp Ser Ser Leu Cys 
115 120 125 
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Met Lys Ala Asn Asp Val lie Pro Gin Phe Arg Ser Asn Asn Gly Lys 
130 135 * 140 

Thr Gin Glu Arg Asn His Ala Phe Ser Phe Ser Gly Arg Ala Glu Leu 
145 150 155 160 

Arg Ser Val Glu Asp He Gly Val Asp Gly Asp Val Pro Gly Pro Glu 
165 170 175 

Thr Pro Gly Met Arg Pro Arg Ala Ser Arg Leu Lys Arg Val Leu Glu 
180 185 190 

Asp Glu Met Thr Phe Lys Glu Asp Lys Val Pro Val Leu Asp Ser Asn 
195 200 205 

Lys Arg Leu Lys Met Leu Gin Asp Pro Val Cys Gly Glu Lys Lys Glu 
210 215 220 

Val Asn Glu Gly Thr Lys Phe Glu Trp Leu Glu Ser Ser Arg He Arg 
225 230 235 240 

Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro Leu Tyr Asp Arg Lys Thr 
245 250 255 

Leu His He Pro Pro Asp Val Phe Lys Lys Met Ser Ala Ser Gin Lys 
260 265 270 

Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met Asp He Val Leu Phe Phe 
275 280 285 

Lys Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
290 295 300 

His Lys Glu Leu Asp Trp Lys Met Thr Met Ser Gly Val Gly Lys Cys 
305 310 315 320 

Arg Gin Val Gly He Ser Glu Ser Gly He Asp Glu Ala Val Gin Lys 
325 330 335 

Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg He Glu Gin Leu Glu Thr 
340 345 350 

Ser Asp Gin Ala Lys Ala Arg Gly Ala Asn Thr He He Pro Arg Lys 
355 ' 360 365 

Leu Val Gin Val Leu Thr Pro Ser Thr Ala Ser Glu Gly Asn He Gly 
370 375 380 

v 

Pro Asp Ala Val His Leu Leu Ala He Lys Glu lie Lys Met Glu Leu 
385 390 395 400 



Gin Lys Cys Ser Thr Val Tyr Gly Phe Ala Phe Val Asp Cys Ala Ala 
405 410 415 
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Leu Arg Phe Trp Val Gly Ser He Ser Asp Asp Ala Ser Cys Ala Ala 
420 425- 430 

Leu Gly Ala Leu Leu Met Gin Val Ser Pro Lys Glu Val Leu Tyr Asp 
435 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Ala Leu Arg Lys Tyr Thr 
450 455 4 6 o 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
465 470 475 480 

Gly Asp Thr Asp Ala Ala Gly Val Arg Asn He lie Glu Ser Asn Gly 
485 490 495 

Tyr Phe Lys Gly Ser Ser Glu Ser Trp Asn Cys Ala Val Asp Gly Leu 
500 505 510 

Asn Glu Cys Asp Val Ala Leu Ser Ala Leu Gly Glu Leu He Asn His 
515 520 525 

Leu Ser Arg Leu Lys Leu Glu Asp Val Leu Lys His Glv Asp He Phe 
530 535 540 

Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg He Asp Gly Gin Thr Met 
545 550 555 560 

Val Asn Leu Glu He Phe Asn Asn Ser Cys Asp Gly Gly Pro Ser Gly 
565 570 575 

Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val Ser Pro Thr Gly Lys Arg 
580 585 590 

Leu Leu Arg Asn Trp He Cys His Pro Leu Lys Asp Val Glu Ser He 
595 600 6 05 

Asn Lys Arg Leu Asp Val Val Glu Glu Phe Thr Ala Asn Ser Glu Ser 
610 615 620 

Met Gin He Thr Gly Gin Tyr Leu His Lys Leu Pro Asp Leu Glu Arg 
625 630 635 640 

Leu Leu Gly Arg He Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
64 5 •' 650 655 

Pro Ala Leu Leu Gly Lys Lys Val Leu Lys Gin Arg Val Lys Ala Phe 
660 665 670 

Gly Gin He Val Lys Gljf Phe Arg Ser Gly lie Asp Leu Leu Leu Ala 
675 680 685 

Leu Gin Lys Glu Ser Asn Met Met Ser Leu Leu Tyr Lys Leu Cys Lys 
690 695 700 
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Leu Pro lie Leu Val Gly Lys Ser Gly Leu Glu Leu Phe Leu Ser Gin 
705 710 * 715 720 

Phe Glu Ala Ala lie Asp Ser Asp Phe Pro Asn Tyr Gin Asn Gin Asp 
725 730 735 

Val Thr Asp Glu Asn Ala Glu Thr Leu Thr lie Leu lie Glu Leu Phe 
740 745 750 

lie Glu Arg Ala Thr Gin Trp Ser Glu Val lie His Thr lie Ser Cys 
755 760 765 

Leu Asp Val Leu Arg Ser Phe Ala lie Ala Ala Ser Leu Ser Ala Gly 
770 775 780 

Ser Met Ala Arg Pro Val lie Phe Pro Glu Ser Glu Ala Thr Asp Gin 
785 790 795 800 

Asn Gin Lys Thr Lys Gly Pro lie Leu Lys lie Gin Gly Leu Trp His 
805 810 815 

Pro Phe Ala Val Ala Ala Asp Gly Gin Leu Pro Val Pro Asn Asp lie 
820 825 830 

Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly Ser lie His Pro Arg Ser 
835 840 845 

Leu Leu Leu Thr Gly Pro Asn Met: Gly Gly Lys Ser Thr Leu Leu Arg 
850 855 860 

Ala Thr Cys Leu Ala Val He Phe Ala Gin Leu Gly Cys Tyr Val Pro 
865 870 875 880 

Cys Glu Ser Cys Glu He Ser Leu Val Asp Thr He Phe Thr Arg Leu 
885 890 895 

Gly Ala Ser Asp Arg He Met Thr Gly Glu Ser Thr Phe Leu Val Glu 
900 905 910 

Cys Thr Glu Thr Ala Ser Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 

Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr Phe Asp Gly Tyr 
930 935 940 

Ala lie Ala Tyr Ser Val Phe Arg His Leu Val Glu Lys Val Gin Cys 
945 950 955 960 

Arg Met Leu Phe Ala Thr His Tyr His Pro Leu Thr Lys Glu Phe Ala 
965 970 975 



Ser His Pro Arg Val Thr Ser Lys His Met Ala Cys Ala Phe Lys Ser 
980 985 990 
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Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 
995 1000 * 1005 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin Val 
1010 1015 1020 

Ala Leu Met Ala Gly He Pro Asn Gin Val Val Glu Thr Ala Ser Gly 
1025 1030 1035 1040 

Ala Ala Gin 'Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 
1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 
1060 1065 1070 

Leu Val Gly He Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 
1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 
1090 1095 1100 

Tyr Cys Val' Pro Lys 
1105 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATHGENEA 
microsatellite 

<400> 32 



accatgcata gcttaaactt cttg 

33 
22 
DNA 

Artificial sequence 

j 

Reverse primer for PCR amplification of ATHGENEA 
microsatellite 

33 ; 

acataaccac aaataggggt gc 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



<400> 
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<210> 34 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer DMCIN-A for PCR on genomic DNA of Arabidopsis 

thaliana ssp. Landsberg erecta "Ler" 

<400> 34 

gaagcgatat tgttcgtg 18 

<210> 35 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-B for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 35 

agatcgcgac aacattcc 18 

<210> 36 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer DMCIN-1 for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 36 

acgcgtcgac tcagctatga gattactcgt g 31 

<210> 37 

<211> 29 

<212> DNA , 

<213> Artificial sequence 

<220> 

<223> Reverse primer DMCIN-2 for PCR on genomic DNA of Arabidopsis 

thaliana s;sp. Landsberg erecta "Ler" 

<400> 37 



gctctagatt tctcgctcta agactctct 



29 
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38 



<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Forward primer DMCIN-3 for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" . 

<400> 38 

gctctagagc ttctcttaag taagtgattg at 32 

<210> 39 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMCIN-4 for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 39 

tcccccgggc tcgagagatc tccatggttt cttcagctct atgaatcc 4 8 

<210> 40 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Forward primer DMCla for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 

<400> 40 

acgcgtcgac gaattcgcaa gtgggg 2 6 



<210> 41 t 

<211> 38 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer DMClb for PCR on genomic DNA of Arabidopsis 
thaliana ssp. Landsberg erecta "Ler" 



<400> 



41 
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tccacggaga tctcccgggt accgatttgc ttcgaggg 38 

<210> 42 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATEAT1 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 42 

gccactgcgt gaatgatatg 20 

<210> 43 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of ATE ATI SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 43 

cgaacagcca acattaattc cc 22 

<210> 44 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<222> Forward primer for PCR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 44 

aaccaaggca cagaagcg 18 



<210> 45 

<211> 18 

<212> DNA 

<213> Artificial'' sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA63 SSLP marker in 
Arabidopsis thaliana subspecies 
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<400> 45 

acccaagtga tcgccacc 18 

<210> 46 

<211> 21 

<212> DNA 

<213> Artificial sequence ; 
<220> 

<223> Forward primer for PCR amplification of NGA248 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 46 

taccgaacca aaacacaaag g 21 

<210> 47 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA248 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 47 

tctgtatctc ggtgaattct cc 22 

<210> 48 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA128 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 48 

ggtctgttga tgtcgtaagt eg 22 



<210> 49 

<211> 22 '; 

<212> DNA 

<213> Artificial sequence 



<220> 



WO 99/19492 



PCT/EP98/06977 



37 



<22 3 > 


acvciac pzrimeir loi. itv^k ampi x r icac ion or 
Arabidopsis thaliana subspecies 


NGA12 8 


SSLP 


marker in 


^ j no s 










atcttgaaac 


ctttagggag gg 






22 


<210> 


50 








<211> 


22 








<212> 


DNA 








<213> 


Artificial sequence 








<220> 










<22 3 > 


roirwaira primer ror rtK amplification or 
Arabidopsis thaliana subspecies 


NGA2 8 0 


SSLP 


marker in 


<4 00 > 


50 








ctgatctcac 


ggacaatagt gc 






22 


<210> 


51 








<211> 


20 








<212> 


DNA 








<213> 


Artificial sequence 








<220> 










< ^ J > 


Reverse primer for PCR amplification of 
Arabidopsis thaliana subspecies 


NGA2 8 0 


SSLP 


marker in 


v yl A A v 

< ft u u > 


51 








ggctccataa 


aaagtgcacc 






20 


<210> 


52 








<211> 


21 








<212> 


DNA 








<213> 


Artificial sequence 








<220> 










<223> 


Forward primer for PCR amplification of 
Arabidopsis thaliana subspecies 


NGA111 


SSLP 


marker in 


<400> 


52 








ctccagttgg 


aagctaaagg g 






21 



<210> 53 

<211> 21 

<212> DNA 

<213> Artificial seauence 
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!23> Reverse primer for PCR .amplification of NGA111 SSLP marker in 

Arabidopsis thaliana subspecies 



<400> 


53 


tgttttttag 


gacaaatggc g 


<210> 


54 


<211> 


2 0 


<212> 


DNA 


< 2 13 > 


HrLiiiciai sequence 


<r 7 7 fl 




<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


54 


ccttcacatc 


caaaacccac 


<210> 


55 


<211> 


20 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


55 



21 



20 



gcacataccc acaaccagaa 20 



<210> 56 

<211> 20 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer for PCR amplification of NGA1126 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 56 



cgctacgctt ttcggtaaag 



20 



WO 99/19492 



PCT/EP98/06977 



39 

<210> 57 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1126 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 57 

gcacagtcca agtcacaacc 20 

<210> 58 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA3 61 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 58 

aaagagatga gaatttggac 20 

<210> 59 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA361 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 59 

acatatcaat atattaaagt age 23 

<210> 60 

<2ll> 18 * 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer for PCR amplification of NGA168 SSLP marker in 
Arabidopsis thaliana subspecies 



<400> 



60 
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tcgtctactg cactgccg 18 

<210> 61 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 SSLP marker, in 
Arabidopsis thaliana subspecies 

<400> 61 

gaggacatgt ataggagcct eg 2 2 

<210> 62 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of AthBI02 SSL? marker 
in Arabidopsis thaliana subspecies 

<400> 62 

tgacctcctc ttccatggag 20 

<210> 63 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of AthBI02 SSL? marker 
in Arabidopsis thaliana subspecies 

<400> 63 

ttaacagaaa cccaaagctt tc 22 



<210> 64 
<211> 21 
<212> DNA 

<213> Artificial sequence 

v 

<220> 

<223> Forward primer for PCR amplification of AthUBIQUE SSLP marker 

in Arabidopsis thaliana subspecies 
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<400> 64 

aggcaaacgc ccatttcatt g ^ 2i 

<210> 65 

<211> 20 

<212> DNA 

<213> Artificial sequence 

<220> : 

<223> Reverse primer for PCR amplification of AthUBIQUE SSLP marker 

in Arabidopsis thaliana subspecies 

<400> 65 

acgacatggc agatttctcc 20 



<210> 66 

<211> 21 

<212> DNA 

<213> Artificial seer. 



<220> 

<223> Forward primer for PCR amplification of NGA172 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 66 

agctgcttcc ttatagcgtc c 21 

<210> 67 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA172 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 67 

catccgaatg ccattgttc » 19 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> Forward primer for PCR amplif icacion of NGA126 SSLP marker in 

Arabidopsis chaliana subspecies 

•-4 00> 6 3 

gaaaaaacgc tactttcgtg g 21 

<210> 69 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223 > Reverse primer for PCR amplification of NGA126 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 69 

caagagcaat atcaagagca gc 22 

<210> 70 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA162 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 70 

catgcaattt gcatctgagg 20 

<210> 71 
<211> 22 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of NGA162 SSLP marker in 

Arabidopsis thaliana subspecies 
j 

<400> 71 

ctctgtcact cttttcctct gg 22 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Forward primer for PCR. amplification of NGA6 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 72 

tggatttctt cctctcttca c 21 

<210> 73 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA6 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 73 

atggagaagc ttacactgat c 21 

<210> 74 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 74 

aatgttgtcc tcccctcctc 20 



<210> 75 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primter for PCR amplification of NGA12 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 75 



tgatgctctc tgaaacaaga cjc 



22 
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<210> 76 
<211> 21 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Forward primer for PGR amplification of NGA8 SSLP marker in 

Arabidopsis thaliana subspecies 

<400> 76 

gagggcaaat ctttatttcg g 21 

<210> 77 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PGR amplification of NGA8 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 77 

tggctttcgt ttataaacat cc 22 

<210> 78 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PGR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 78 

gcgaaaaaac aaaaaaatcc a 21 

<210> 79 

<211> 21 , 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1107 SSLP marker 
in Arabidopsis thaliana subspecies 



<400> 



79 
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45 


cgacgaatcg 


acagaattag g 


<210> 


80 


<211> 


21 


<212> 


DNA 


<213> 


ArT 1 f i ri fll cam mnr-a 


<220> 




<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


80 


gaaatccaaa 


tcccagagag g 


<210> 


81 


<211> 


22 


<212> 


DNA 


<213 > 


Artificial sequence 


^- O O pi 




<223> 


Reverse primer for PCR amplification 
Arabidopsis thaliana subspecies 


<400> 


81 


tctccccact 


agttttgtgt cc 


<210> 


82 


<211> 


19 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Forward primer for PCR amplification 
Arabidopsis thaliana subspecies 



21 



21 



marker in 



22 



<400> 82 
taccgtcaat ttcatcgcc 



<210> 83 

<211> 22 

<212> DNA 

<213> Artificial sequence 



19 



<220> 

<223> Reverse primer for PCR amplification of NGA24 9 SSLP marker in 

Arabidopsis thaliana subspecies 
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<400> 83 

ggatccctaa ctgcaaaacc cc . 22 

<210> 84 

<211> 22 

<212> DNA 

<213> Artificial sequence 

<220> ; 

<223> Forward primer for PCR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 84 

aatcccagta accaaacaca ca 22 

<210> 85 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of CA72 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 85 

cccagtctaa ccacgaccac 20 

<210> 86 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA151 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 86 

gttttgggaa gttttgctgg , 20 



<210> 87 

<211> 24 

<212> DNA 

<213> Artificial sequence 



<220> 
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<223> Reverse primer for PCR amplification of NGA151 SSLP marker in 

Arabidopsis chaliana subspecies 

<400> 87 

cagtctaaaa gcgagagtat gatg 24 

<210> 88 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<222> Forward primer for PCR amplification of NGA106 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 88 

gttatggagt ttctagggca eg 22 

<210> 89 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA106 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 89 

tgccccattt cgttcttctc 20 

<210> 90 

<211> 20 

<212> DMA 

<213> Artificial sequence 
<220> 

<223> Forward primer for . PCR amplification of NGA139 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 90 

agagctacca gatccgatgg 20* 



<210> 91 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Reverse primer for PCR amplification of NGA13 9 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 91 

ggtttcgttt cactatccag g 21 

<210> 92 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of NGA76 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 92 

ggagaaaatg tcactctcca cc 22 

<210> 93 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA7 6 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 93 

aggcatggga gacatttacg 20 

<210> 94 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Forward primer for PCR amplification of ATHS0191 SSLP marker 
in Arabidopsis thaliana subspecies 

<400> 94 

ctccaccaat catgcaaatg *' 2 0 
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<210> 95 
<211> 21 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Reverse primer for PCR amplification of ATHS0191 SSLP marker 

in Arabidopsis thaliana subspecies 

<400> 95 

tgatgttgat ggagatggtc a 21 

<210> 96 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

< 223 > Forward primer for PCR amplification of NGA129 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 96 

tcaggaggaa ctaaagtgag gg 22 

<210> 97 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA129 SSLP marker in 
Arabidopsis thaliana subspecies 

<400> 97 

cacactgaag atggtcttga gg 22 



<210> 98 

<211> 8062 

<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

< 22 3> Genomic DNA sequence of AtMSH6 

<400> 97 



ttttttggtt gctaacaata aaggtatacg gttttatgtc atcaatataa ctatatataa 6 0 
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aagaaatgaa agatatatat tgttttttca tttatcaaac aaaacaacaa gaccttcttt 120 

ttacttttta cattggtcaa caaaatacaa gataaacgac accgct taat cacctcccaa 180 

ttttacccct aagtttaaca cctagaacct tctccatctt cgcaagcaca gcctgattag 240 

gaacagcttt accatcctca tattcctgaa ctacctgagt cctcccattg atctgtttcg 300 

ccaaatccgc ttgtgacatc ttcttctcca atctcgcttt ctgcatcatc aacctcacct 360 

ctgctttcac acgatccatc gccgcaggct ctgtttcttc ctccagcttc ttcgtgttaa 420 

tcaccggaac cgccgtagat tccccctttt tgttcgaacc ggcatcgaat ttcttaaccg 480 

tttgaaccgc gacaccgttt ctcagagctg cgttaaccgc tcccggatcg cgtaggtctt 540 

ggctcttttg ttttgatttg tggagaacta ctggttccca gtcttgtgtt actgctcctg 600 

ggtatctgct cggcatcgcc gatgaattga gagaaaggaa caacgcgaaa attttattaa 660 

tctgagtctt gaaattgaga aacgatgaag atgaagaacg crgtcgagag gattgtgata 720 

tttatatata cgaagactgg ttcccggaga atccgatcac ctirctcctcc attctcgtct 780 

ctggaacgtc cttagagatg actgacgacg tgtcactacc cgacctgcag ctaaccaatg 840 

ctttttgggt tggattcgtg gtacaccata ctacccgatc cggcccaatg gttttatata 900 

aatttggttt tcggttcggt tatgagttat catcaaaact aagccaacca aaaattttcg 960 

taaaatttat ttcggtttca attcggatcc cttacttcca gaaccgaatt attcgaaacc 1020 

ggggttagcc gaaccgaata ccaatgcctg attgactcgc cggccagaaa gatccaacgg 1080 

tatacaataa tagaacataa atcggacggt catcaaagcc tcaaagagtg aacagtcaac 1140 

aaaaaaagtt gagccctgag gagtatcgtt tccgccattt ctacgacgca aggcgaaaat 1200 

ttttggcgcc aatctttccc ccctttcgaa ttctctcagc tcaaaacatc gttcctctct 1260 

cactctctct cacaattcca aaaaacgcag cgccagagat cgattttgtc tttcttccaa 1320 

aaacccacgg cggcgactac gaagggtttg gtttccggcg atgccgctag cggcgggggc 1380 

ggcagcggag accacgattt aatgtgaagg aaggggatgc caaaggcgac gcttctgcac 1440 

gttttgctgt ttcgaaatct gtcgatgagg ttagaggaac ggatactcca ccggagaagg 1500 

ttccgcgtcg tgtcctgccg tctggattta agccggctga atccgccggt gatgcttcgt 1560 

ccctgctctc caatattatg cataagtttg taaaagtcga tgatcgagat tgttctggag 1620 

agaggtacta atcttcgatt ctctcaattt tgccatcttt agccggaaga agaagattcg 168 0 
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tgtaatttgt 


tgtattcgtt 


ggagagat tc 


tgattactgc 


attggatcgt 


tgtttacaaa 


1740 


ttttcaggag 


ccgagaagat 


gttgttccgc 


tgaacgatcc 


atctctatgt 


aegaaggeca 


1800 


atgatgttat 


tcctcaatct 


cgttccaaca 


atggtaaaac 


tcaagaaaga 


aaccatgett 


1860 


ttagtttcag 


tgggagagct 


gaactcagat 


cagtagaaga 


tataggagta 


gatggcgatg 


1920 


Ctcctggtcc 


agaaacacca 


gggatgegee 


cacgtgcttc 


tegcttgaag 


cgagttctgg 


1980 


aggatgaaat 


gacttttaag 


gaggataagg 


ttcctgtatt 


ggactctaac 


aaaaggctga 


2040 


aaatgctcca 


ggatccggtt 


tgtggagaga 


agaaagaagt 


aaacgaagga 


accaaatttg 


2100 


aatggcttga 


gtcttctcga 


atcagggatg 


ccaatagaag 


aegtcctgat 


gatccccttt 


2160 


acgatagaaa 


gaccttacac 


ataccacctg 


atgttttcaa 


gaaaatgtct 


gcatcacaaa 


2220 


agcaatattg 


gagtgttaag 


agegaatata 


tggacattgt 


gcttttcttt 


aaagtggtta 


2280 


gtaactatta 


acctagtgct 


caatccatct 


ccccaatgcg 


atctgttcac 


ttacatctgt 


23 40 


ttacgttatg 


cccctcccag 


gggaaattt t 


atgagctgta 


tgagecagat 


geggaattag 


2400 


gtcacaagga 


get tgactgg 


aagatgacca 


tgagtggtgt 


gggaaaatgc 


agacaggtaa 


2460 


attagttgaa 


acaactggcc 


tgcttgaat t 


attgtgtcta 


taaattttga 


caccaccttt 


2520 


tgtttcaggt 


tggtatctct 


gaaagtggga 


tagatgaggc 


agtgcaaaag 


ccattagctc 


2580 


gtgggtaagg 


gaaccaccat 


actttatgga 


attegtctae 


tgccacttcg 


gctaggattc 


2640 


aagaaacgga 


aatcacctca 


agcatcatta 


gtcaggaccc 


tgagaactca 


ggatgtttcc 


2700 


ttattcgtta 


tataataagt 


cttttcatca 


aggagtaaca 


aacaaaactt 


gcacaatatt 


2760 


tgtgtgctca 


ctggcaaggc 


atatataccc 


agctaacctt 


tgccagttca 


ctgtagtaac 


2820 


agttacggat 


aatatatgtt 


tacttgtatg 


tggtaccctc 


attttgtctc 


tcatggaggc 


2880 


tctcaagcct 


tgcgttgaaa 


ctggacagtt 


acatatgett 


ccaacagaaa 


etagcatgea 


2940 


gattcatatg 


ctttcctatt 


ctactaacta 


tgtattgaca 


cactcgttgt 


ttcttttgaa 


3000 


agatataaag 


ttggacgaat 


cgagcagcta 


gaaacatctg 


accaagcaaa 


agecagaggt 


3060 


gctaatactg 


taagtttcct 


tggataggtc 


aaggagagtg 


ttgcagactg 


ttcctgatca 


3120 


tttctttttc 


tgtacattac 


fettcatgetg 


taatcaactc 


aatggctatt 


ctggtctgat 


3180 


tatcagataa 


ttccaaggaa 


gctagttcag 


gtattaactc 


caccaacagc 


aagcgaggga 


3240 


aacatcgggc 


ctgatgccgt 


ccatcctctt 


gctataaaag 


aggtttgtta 


txtacttatt 


3300 
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tatcctatca tgttcagttc atccaagtcc tgaaaaatta cacccttctt taccaatctt 3360 

ccatcaagct gtgtaaagga tttggaatta gaaaatcatc atttgatgcc ttgttttata 3420 

tgcaagaggt tcccttgaaa agatctgttt aagatccttt gcacttgaaa aattcaatct 3480 

ttttaagtga atcccctact ttcttacaat gatcatagtc tgcaattgca cgtcaagtaa 3540 

tatcattcct tgttactgca tccccctctt tcttaatgac cattgtctat gttgtgtttg 3600 

tctcgtgtgc tggagaaaat gatagctgat ccaagctgta cattatcatg attaagtagc 3660 

tgctcaggaa ttgcccttgg ttacattgcc taatggtttg atgtcaattt ttcttctgaa 3720 

tctttacttt agatcaaaat ggagctacaa aagtgttcaa ctgtgtatgg atttgctttt 3780 

gttgactgtg ctgccttgag gttttgggtt gggtccatca gcgatgatgc atcatgtgct 3 840 

gctctcggag cgttattgac gcaggcaagc aagtgtatcc tgtatcttat gtgtaccacg 3 900 

tgacttcctg tgcatatacc tgggtt:gcag gaactaattc tgaatcacca tttggtatgt 3960 

ttttcccagg cttccccaaa ggaagcgcca tacgacagca aaggcaaact gcttgtatcg 4020 

ccagttgttt tgttaaacag aatttaaggt aaatgacact ggctaattta aagtgcatac 4080 

atgttgaaat attgcagggc taccaagaga agcacaaaag gccctaagga aatatacgtt 4140 

gacaggtacc atttcagtag gcaagctaac tgacaattca accgctcacc gaatgatagg 42 00 

tctcttaaac actgctaatg tagatgatgt ttatgtttca atctaatagg gtctacggcg 42 60 

gtacagccgg ctccagtacc acaagtaacg ggggatacag atgccgctgg agttagaaat 43 2 0 

ataatagaat ctaacggata ctttaaaggt tcttctgaat catggaactg tgctgttgat 43 80 

ggtccaaatg aatgtgatgt tgcccctagt gctcttggag agctaattaa tcatctgtct 4440 

aggctaaagg tgtgttggct tgtttagttt ttgcttttca caaattaagc aaaggaactt 45 00 

ttcataactt acagtttcta tctacttgca gctagaagat gtacttaagc atggggatat 4560 

ttttccacac caagtttaca ggggttgtct cagaattgat ggccagacga tggtaaatct 4620 

tgagatattt aacaatagct gtgatggtgg tccttcaggc aagtgcatat ttcttttttg 4680 

ataacttcaa ctagagggca gacatagaag gaaaaattct aatacttcgt acggatctcc 474 0 

agtaagtaat agccgatttt fcgtttaccta tgtagggacc ttgtacaaat atcttgataa 4800 

ctgtgctagt ccaactggta agcgactctt aaggaattgg atctgccatc cactcaaaga 4860 

tgtagaaagc atcaataaac ggcttgatgt agttgaagaa ttcacggcaa actcagaaag 4920 
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tatgcaaatc actggccagt atctccacaa acttccagac ctagaaagac tgctcggacg 4980 

catcaagtct agcgttcgat catcagcctc tgtgttgcct gctcttctzgg ggaaaaaagc 5040 

gccgaaacaa cgagcaagta tcaatcacaa gttttctgag taacgccctc cacgagtagt 5100 

ataggactaa aacattacgg gtctagctaa agactgttct ccttcttttg caatgtctgg 5160 

ttattcatta catttctctt aacttattgc attgcaggtt aaagcatttg ggcaaattgt 5220 

gaaagggttc agaagtggaa ttgatctgtt gttggctcta cagaaggaat caaatatgat 5280 

gagtttgctt tataaactct gtaaacttcc tatattagta ggaaaaagcg ggctagagtt 5340 

atttctttct caattcgaag cagccataga tagcgacttt ccaaattatc aggtgcccat 54 00 

ctatctttca tactttacaa caaaatgtct gtcactactc aaagcaatgc atatggctta 5460 

gatctcaact cacaccccga ggatcctaaa gggatttgct ttttattcct aatgtttttg 5520 

gatggtttga cttatttcta acttgaactt attaatcttg taccagaacc aagatgtgac 5580 

agatgaaaac gctgaaactc ccacaatact tatcgaactt tttatcgaaa gagcaactca 5640 

atggtctgag gtcattcaca ccataagctg cctagatgtc ctgagatctt ttgcaatcgc 5700 

agcaagtctc tctgctggaa gcatggccag gcctgttatt tttcccgaat cagaagctac 5760 

agatcagaat cagaaaacaa aagggccaat acttaaaatc caaggactat ggcatccatt 582 0 

tgcagttgca gccgatggtc aattgcctgt tccgaatgat atactccttg gcgaggctag 5880 

aagaagcagt ggcagcattc atcctcggtc attgttactg acgggaccaa acatgggcgg 5 94 0 

aaaatcaact cttcttcgtg caacatgtct ggccgttatc tttgcccaag tttgtatact 6000 

cgttagataa ttactctatt ctttgcaatc agttcttcaa catgaataat aaattctgtt 606 0 

ttctgtctgc agcttggctg ctacgtgccg tgtgagtctt gcgaaatctc cctcgtggat 612 0 

actatcttca caaggcttgg cgcatctgat agaatcatga caggagagag taagtcttgt 618 0 

tctcaaaata ccaattcctc gaactattta ctcagatttt gtctgattgg acaaggtggt 624 0 

Cttgcttttt tttaggtacc tttttggtag aatgcactga gacagcgtca gttcttcaga 630 0 

atgcaactca ggattcacta gtaatccttg acgaactggg cagaggaact agtactttcg 636 0 

acggacacgc cattgcacac Ccggtaacct gctcttctcc ttcaacttat acttgttgat 642 0 

caacaaaaac atgcaattca ttttgctgaa acttattgat ttatatcagg tttttcgtca 6480 

cctggtagag aaagttcaat gtcggatgcc ctttgcaaca cactaccacc ctctcaccaa 654 0 
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ggaattcgcg tctcacccac gcgtcacctc gaaacacatg gcttgcgcat ccaaatcaag 6600 

acccgactac caaccacgtg gtcgtgatca agacctagtg ctcccgcacc gtctaaccga 6660 

gggagcttgt cctgagagct acggacttca agtggcactc atggctggaa taccaaacca 6720 

agtggttgaa acagcatcag gtgctgctca agccatgaag agatcaattg gggaaaactt 6780 

caagtzcaagt gagctaagat ctgagttctc aagtctgcat gaagactggc tcaagtcatt 6840 

ggtgggtatt tctcgagtcg cccacaacaa tgcccccatt ggcgaagatg actacgacac 6900 

tttgtcttgc ttatggcatg agatcaaatc ctcttactgt gttcccaaat aaatggctat 6960 

gacataacac tatctgaagc tcgttaagtc ttttgcttct ccgatgttta ttcctcttaa 7020 

aaaatgctta tatatcaaaa aattgtttcc tcgattataa caagattata tatgtatctg 7080 

tcggcttagc tatggtatat aatatatgta tgttcatgag attggccaag agaaatactc 7140 

acaaacagta tattaagaag gaaacatgtt tacgcatcaa tttaagtttc aagacaaact 7200 

gcaaataacc tcgaccaaag ttgcaaagac caaacacaaa ttacaaaact tataagacct 7260 

aagttctgaa ctccctaaaa ccaaaaaaaa aaacagaaca tattttgttg cacctacaaa 7320 

caacacaaac ctacatagtt tataacttac tcatcactga gactaacatc agaatcattc 73 80 

tccatttctt catcttcact ctcatcacca tcaccaccac catgacgatt ctcctcctcc 7440 

tcacgtaacc tagcaatctc actctgagct ctatcaacaa tctgcttctt ctgcaactcc 7500 

aaatctctct gaaaatcagc tctcatcctc tccaactcct ccacttgctc cttcctactc 7560 

ttctccatct tctcataaac cttcccaaac ctctcaacag aatccgccaa catcttatac 7620 

gaagcagcgt cattaacctt cttcctctcg tactcaacct catcatcctc atcctcctcc 7680 

tcttcagaat caccaggact atccaccatc tcatcaaacc cactagactt atctaaataa 7740 

accttagtgt tcataaacac aaactcacct gaatcaacac cacaagctaa acctaaatcc 7800 

gacttgggcg aaacacaaag caacatatcc aacttattga aaaacgacca tttacttgaa 7860 

cctaaacctg atttctcaac cttaatcttc tcttttctat acttcctctt caagtcatca 7920 

atcattctcc tzacattgcgt ctcagatttc tccatcccta gctcctcact cactttctca 7980 

gctacttcat tccaatcctc ^ttcctcaaa ctccttctac ccaatcgcaa aaacctatct 8040 

ccccaaactt caagcaacac aa 8062 
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Figure 2 
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TPXTTTGGTTGCTAfcCJWTAMflCTATACCGTTTTATGTCft.TCAA.TOTAR 

CTA.t^TAAAA6W» , fGJ^»TATATATTGfTTtTT»TITAtCW*C 

AA/^CAACAACACTTTTTtTTTAC7TTTTACA7TG(5TCWCAftAATJVCflA 

CATJ«ACGACATCC7TWATCATTTCCC»TTtTACCCCTAAGITTttCA 

rCTA<3MCCTTCrrCCATCTTCCCTJiCCACA«CCTCArTAGQAA^ l 3C'ITT 

ACCAITCTCATATTCCr6A*CTACCT(SAfiTCCTCTCATTBATCT6TTTCfi 

CCAAATCCGCTTGTGACATCTTCTTCTCCAATCtCOCTTTCTGTATCATC 

AACCTCACCTCTCC7TyCACACGATCCATCGCCGC:AGGC7<:TCTTTCT7C 

TTCCA-SCTTCTTDETCTTAATCACCCGAACCGCCGTA^rTTCCCCTTTT 

TGTTCGAACC-MCATCI^ATTTCTTAACCGTT'T^AACCGCtJACACCGTTT 

CTCADA&CTGCdlTAACCGCTTTCGGAtCGCGTAGGTCTTGGCTCTTTTG 

TTrrGMTTGWGAGAACTACTWTTCCCABTCTTCretTACTBCtCCTG 

GGTATCTGCTCGGCATCGTCCATGAATTGPiGAGAAAGGAACWWZGCCAAA 

ATTTTATTMTCtGAGTTTTGA^TTGACAWCGATG^ATGAAGAATG 

TTGTTGAG.tlGGATTBT<:A7A-J'F7A7AfATACGAAGATTGGrT J ?CTGGA«3A 

^TCCA-TCATCrrJTTCTCCATTTTCCTCTCTGGAACGTrCTTAGAGATG 

ATTGACGACGTGTCATTATC7GAT?TGCAGa'TAACCAATG[rrrrj7QGGT 

TGGAT-TCGTGGTACACC^TATTATCCGATTTGGCTCAATGGTTTTATATA 

.V^TTTGGTT~7CGGTTCGGTTATGAr.TTA7CATTAAAATTA?,&CtAACCA 

AAAATTTTCGTAA^.TTTATTTCCGTTrCAATTCGGATCD^TACTTtCA 

CAACCOAATl^TTCGAAACCGGGGT IA "CCtAACCGRftTACCPJIT GCCTG 

ATTl^CTCGTTGGCTAGAAAGA-rCCAACGCTATACAATAATAGAACATAA 

ATCGGACGGTCA7CAAAGCCTCAAAGAGTGAACAGTCAACAAWAAAGTT 

GAGCCCTGAGGAGTA7CGTTTCCGCCA7TTCTACGACCCAAG«CGAAAAT 

TTTTGGCGCCAATCTTTCCCCCCTTTCGAATTCTCTCAGCtCAA.^CATC 

GTTTCTCTCTCACTCT^C^CACAATTCCAAAAAATGCAGCGCCAfiAGAT' 

CGATTTTGTCTrrCTTCCAAAAACCCACGGCGGCGACT?^GAAGGGTtTG 

GTTTCCGGCGATCCTGCTAGCGCCGGGCCCGGCAGCGa^GACCACGATTT 

AATGTGAAGGAACGGGATGCTAAAGGC5ACGCTTCTGTACGTTTTCCTG7 

TTCGAAATCTGTCGA7SAGG77AGAGGAACGGATACTCCACCGGAGAAGG 

TTCCGCGTCGTGTCCTGCCGTCTGGArtTAAGCCGGCTGAATCCGCCGC7 

GATGCTTCGTCCCTGTTCTCCAATA1TAT<?CATAACT7TGTAAAAGTCGA 

7Wl<WA^TTGrrCrGGAGAGAGGTACTAAfCTTCCATTC7CTTAA7TT 

TGTTATCTrTAGCTGGAA&AAGAAGATTCGTGTAATTTCTTGTATTCGTT 

GGA<»eATTCTGArJACTGCAT7GGATCGTTG77TACAAArrrrCAGGAC 

CCGAGAAGATGTTGTTCCGCTGAATGATTCATCTCTATGrAtGAAGGCTA 

AKiATGTTATTCCTCAA7TTO&7?CCMTAA7SGtAAAACTCAAGAAAGA 

AACCATGCTnTAGTTTCAGTGGGAGAGCTGAACTTAGATCAGTAGAAGA 

TA^AGGAGTAGATGGOGA7£TTCCTGGTCCA6AAACACCAGGGATGCGtC 

CACGTGCTTCTCI5CTTGAAGCGAGTTCTGGAGGATGAAATGACTTT7AAG 

GAGI^TAA^GTTCCTG7A7^GGACTCTAACAAAAX3GCTGPAAATGCTCCA 

«5ATCCG6TTTGMGAGAGAJWAAAGAAGTAAAC<^GAA0CAAA7TIC 

AATGGCTTGAGTCTTCT03AATCAGGGA7eCCAATAGAAGACGTCCTGAT 

GATCOCCTTTACGATJ^AAAGACCTTACACKTACCACCTEATGTITTCAA 
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fGGACATTXilSCTTTTCTTTWU^TGfiTTASl'WCIATTAATCTACTGTT 
Ci^7C^ATTTCCTCAATGTGArTTCTTCACT7j\CATCTG7TTACGT7ATG 
CTCTTCTCM6GGA»TTmTGAGCTGTATGATCT^TGW<3AAtTAG 
GTCACAAGGAGCTTGACTCG»GATWCCATGAGTCGTGTGGG?IAAATGC 
AGACAGGTA.WTTAC?Tr>AAAC^ACTGGCCTGCTTGM7TA7TGT5TCTA 
TAAATTrTGACAGCACCTTTtGTTTCAGGTTCGTATCTCTGAAAGTGGGA 
TAGAT^GGCA&TGCAi^CCT^TTAGCTCCTGGGTMG^AACCATCftT 
ACTTTATGGAMTCGTTtACTGCT , P.CTTCGGCTA^AT7TAAGAAATGGA 
AATCaCTTC^GCATCATT^CT^AG^ATCCrrGAGAACTCAGGAtGTTTTC 
TTA7TCGTTATATAA1AAGTCTTTTCATCAAGGA5T*ACAAAiCAAA>CTT 
E^CAftTAT7TGTGTGCTI^CTGGCAAGGCATATATACCCAGCTAACC'T J T 
TGCTAGTTCACT^7AGT.CiACAGTTACGGATAATATATG7T7ACTt^TAXG 
TGCTACCCTCATTTTGT^TCTCATGGAGGCTTTCAAQCCTTGTGTTGAAA 
CTGGATA£TTACATATGCTTCCAACAGAAA£TAGCATGCaGATTCATATG 
CTTTrCTATTCTACTAATTATGTAiTCACACACTCGTTSTTTCTTTTCAA 
AGATA7AAAGTTGGACCAA7CCAGCAGCTAGAAACATCTGACCAAGCAAA 
AGCCA0AGDTGC-^TACrCTAAGTT7TeTTGCA7AG<i?CAA^GA&AGTC 
TTGCAGACTC7777TG ATCA777C7T7T7CTGTACft7TftCTTTC.ATGCTG■ 
TAATT AACTCAATCCCT ATT C-TGGT C7 G A^T ATCAG AT AA7TeC AAGGAA 
GCTA^rrTCAGGTATTAACTCCATC^CAGCAAGOSACGfiAAACPfcTCGCGC 
CTOATGCCG7CCATCTTCTTGCTA7AAAAGAGGTTT3TTATTTACT7ATT 
TATCTTATCATCTTC^TTCATCCAAGTCCTGAA^rTACACT^TTCTT 
TACCAATCTTeCATCAAGCTGTGTAAAGGATTTGGAAT7AGAAAATCATT 
A7TTGATCC7TTGTT7TATATer7LAGAG^tTCCCTTGAAAAGATCTETTT 
AAG A7TCTTTGCACTTG AAAAATTCPATCTTTTT AAG7 CAATC CCCTACT 
TTCTTACAATGATCATAa7CTGCAA7TGCATGTCAAGTAftTATCAT7CCT 
7GT7 AC7GCATCCCeC7CTnCTTAATGACC ATTGTCT A7 GTTG7 GTTTC 
7CTCCTGTGCTG<^AAAATGATAGCTGATCCAAGCTC7ACATTATCATG 
ATTAAGTAGCTGCTCA&5AATTGCCTT7GGTTACATTGCCTAATGG77TG 
ATGTCAA7T7TTCTTCTGAAteT7TATTTtAGATCAAAftT(^A<^7ACAA 
AAGTGTTCAACTGTG7ATGGATTTGCTTTTGTTeACtG7CCTeCCT7GAG 
G7TTTGGG7TGGCTCCATCAGCGAl^A7GCATCATGTCC7GCTCTtGGAG 
CGTTA7TGAT<>CAGGTAAGCAA^TGTATTCrrGTA7C7TATGTGtACC?lTG 
TGACTTCCTGTGCATA7ATTTGGGTTGCAGGAACTAA7TCTGAATCAOCA 
TTTGGTATGTTTTTTCCAGWTTCTCCAAAGGAAGTGTTATATGACAGTA 
AAGGTAAACTGCT«TATCGCqAGTTGTTTTGT7AAACAGAATTTWW5GT 
AAATGACACTGGTTAATTTAAAGTGCATACATGTTGAAAYATTGCAGGGC 
TATCAAGAGAAGCACAAAAGGCTCTAAGGAAATATAC&7TGACAGGTACC 
ATT^CAGTAGGCAAGCTAACTGACAATTTAACCGCTCACeGAATGATAGG 
TrtCTTAAACATTGCTAAT^TAGATGATGTTTATGTTTCAATCTAATAGG 
GTCTACGGCGGTACAGTTGGCtCCAGTACCACAA<5TAA7GGGGGATACAG 
ATCCTGCTGGAGTTAGAAA7ATAATA^AATCTAACGGATAeT7TAAAOTT 
TCTTC7C^7CATGGAACTGT^TGTTGATGGTCTAAA7GAA7eTGATGT 

Figure 11 {CantAnwd) 
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7GCCCTT AGTGCtCTTCG AGftDCT AATT J^T^TCTGTCTAfiCCTAAACS * * 50 

t roTTGOTTGTTTAGTrtTTGCrm 4 500 

TTC A fAACT X AGAGT 7 TCT A TCT AC XT GC AG CT AG AAG At G7AGT X AAGC * S 5 0 
A7GGGGATATTTTTCCATACCAAGTTTACACGQ^TTGTC7CAGAATT^AT 
GGCCaMCGRTGGTJWVTCTW^AraTTTAKAATAGCTBTGAtGGTGG 

7CCTTC AGGCAftGTGCATATTTCTT TTTTGATAACTTCAACf AG AGQ5CA < 7 ^0 
G ACArAiSWGG AAAAAtTCTAAT ACTTCGT ACGGATCTCCMGT AAGTAAT & Q 

AGCCGATTTTT GTTTACCTATGTAGGG ACCTTCTACAAAT ATCTT GATAA * 3 $° 

CTGTGTT AGTCCAACTGGTAAGCGACTCTT AAGGAATTGGATCTGCC ATC * * 5 0 

C ACTCAAAG ATGTAGAAAGCATC AAT AAACG«TTGAX€T AGT7G AAGAA * 

TTC WGGC AAACTC AG AAAC TWGCAAATCACTGGCC AGTATCf CC AC AA * 9 5 0 

ACTTCCAG AGTT AGAAAG ACTGCTCGG ACGCATC AAG7CTRGGGTTCG AT * 0 00 
MTCAGCCTCTGTGTTGCKGCTCT7CT&GGGMAAAAGTCCTGAAACAA 
CGAC7AAGTATCIWKAGAAGTTTTCTGAGTAATGCCTrcCA«AGTA«T 

AX AGGACT AAAftC ATTACGGGTCTACCT AAAGACTCTTC7CCTTC77TTG * 1 50 

CRATGTCTGGTTAT7CATTACATTTCTCTTAACTTA7TGCATTGCAGG7T ^ 2fl0 
AAAGGAf TTGGGCAA ATTGTG AAAGGG77 CAGAAGTGGAATTGATC-7 GTT 

G77GGCTC TAG AC AAGG AATC AAAT ATG ATG AGTTTGCT77AT W ACTCT 5300 

GTAAAC7TC'JT*.TATTW?7AGGWiAAAGCG«l5C7AGAG7 ~ATT7 CTTTCT * 1S0 
CAA7TCGAAOCAGCCA7RGA7AGCGAC.T77CCAAATTATCAEGTCCCCA7 

C7ATC7TTC AT ACTTj ACAACAAA ATG7C7G7 C AC7AC7 C AM&CAA7 
ATATGGCTTAGAXCTCAACTCACACCC&GAGGATCCTAAMGGATTXGCT 
77TT ATT cctaATOTTTTTBGATGGTTTG ATTTATTTC7 AACTTG AACTT 5 * 5 0 

ATTAATCtTGTACCAGAKCAAMTGTGACAGATGAAAACGCTGAAACTC 

TCAGAAT ACTT ATCGAACTTTTT ATCGAAAG AGC AACTC.WXCGTC7G AS 5650 
GTCA7 TC AC ACCATAAGCTGCGTAGATG7CCTC ACATCf 7TTGCAA7 CCC 5700 
AGCAAGTCTCTCTGCTGEAAGCAXDGtCAGGCCTGTTATTTTTCCCGAAT 
CAGAACCIACAGATCAGAATCTAGAAAACAAAAGGGCCAArACTXAAAATC 
C^GGACXAT«<}CA7CCA777GCAG7TECAGCCGA7GG7CAA77GCCTGT 
7CCGAAT«lTATACrCCTTeGCGAGGCfAGAAGAAGCAGTGGCA«CA7TC 

A7CCTCGGTCA7l^TTACTGJsCBGGACC AAACA7GGGCGGAftAATCAACT * 93 0 

C7TCTTCGTGCAAC ATGTCTGGCCGXXATC7TtGCCCAAGTT7^T AT ACT 
KTTAGA7AATTAC7CTATXC^TGCKA7C^G77CT7C^CATGAATAAT 
AAATTCTGT77TCTGTC7GCAGCTTCGC7GCTACG7GCCCTG7GAGTCTT 
GCGAAA-rCTCCCTOGTGGAtACTATCTtCACAMGCTTGGCGCATClQAT 
AGAA7CATGACAtGA6AGAGXAAGTT77GTTCTC^tAAATACC^ATTCCXC 
GAAC7 , ATT7AC7CAGAXmGTQTGATTGGACAAGGTGCiTTTTCC77TTT 
77TAGGTACCXTTTTGGTAGAATGCACTGAGACAGCGTCAGTTCTXCAGA 
ATGCAACTCA^^XTCACTAPXAAXCCT7GACGAAC7GCGCAGAGGAAC7 
AGTACTTTCGATGGiATACGKATTGCATACTCG^yTAACCTGCTCTTCTCC 
XTCAACTTATACTXGmATCAACAAAAACAXGCAAtTCATXrtGCTGAA 
ACTTAT7GATT7ATATCAGGrrTXTCGTCACCTGGTAGAGAAAGTTCAAT 
GTOG<^TGCTC77TGCAACAC^T7ACCACCCTCTCACCAA(MAAXXCGC^ 
XCTCACC0ACG7GTCACCTCGAAACACATGGC7tGCGCATXCAAATCAAG 
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a TP'frt a 1* viitp a a r^t i H r^ , Pr , ^T*TnTr2 ft.Tr 1 a H£^iwt*t a nT^TTPTT^.T q^p 

fti u/Ll s r\ J W*V\H»**" L*bX J J W A 4 ^Anvtl^'w 1 I^U 1 U 1 1L1 lul AL^Lr 




ftTTT a iir,pr ti^^A.r.pTTm' cctc ACAfifT APftn kP'TTC iARTf;cnir''pr 

La 1 X 1 nnWvunuuunuL 1 L \J J A uauriuu x nu^unu x x urtrtu 1 u\>\rxW* X Li 


6700 


ATfiftfTfi Hi^TXXf'PlliACCAACTCfiTT^AA D fir ATCAG^TGCTGCTPi 

r% l uuv X VJ wnn L AV/L«/vVlv u V'U *■ uu* J x \i/vv*^nu\rn X ^» nv -J 4 Jv J WW X *w-n 




W<n 4 »3<V%v AU/\ 1 iW^Xt X X uuUunnnnW 4 4 VfW^iJ X wrVPvHP i vAvv W~l x 


6800 


V l vJ»s> L 1 V X VAny X v X u<_>r\X 13 AA*Jr»iv b *-? '-7 — ■ x vrvnv X <-r-r» 1 1 Vr\j x utpvjr X-o- A X 


6B50 


Tr^TPf!AnTPftPP , r'APB ArA&TGrrr(V^TTr^r^AA^ATrtrTTli?^ir_XLf r 


u ^ v ** 


kXxuixxj uVj x x \JUVrn4 VJ» l P*^ i Uinnn x u-§w x x x ct\+ x vi u x x krx\nnX 


6950 


VtLJLi X rl X UnViJi 1 nAV/*M X 1 \3njnu'u jt 1 X nrV»3 J \* X X X X 1 i 


7000 


TTR ATnTTTft.TTPfTP'T'Tli A AAA fiTfSP TTIiTllTB TflL&LA ft S.^TP.T'T'l'PiP 
\* 1 Vn JVl 4 lAl 1 V 1 V * inftnrWt X X XnXnXnl vn/l/UVin i 4 v 4 X X v<-" 


7050 

f W tJ *J 


4 ^un x x n x Jv\.s_t£-Lri\^n x ml fllvl rS4 v J u x t-iLju? x x x/uiLi xrt x Liu ml J-L x 


' Ivv 


HATA'PElTR'To Tr'T-TflTRanftTTrrTCAft fiA^A sts t snvAFAAACAfiTiv 

•rV-L* ftlnlUljilul X Li/i X uA'on J J uu X wlAu/lUrlnn X runLnrvisrA^ Xf\ 




T B TTl B f2 B inn nn Tl^IlTfiT^TlYTP.PS'r'F'll BT-T'PZl B ft TT^T'UIin Q'TIL A a PT 
1 H X 1 AAuA/ViJUAAA XAX *J i J 1-n X ULtA 1 £ f\Ai 1 i./\r J *U 1 X ji^»JVVjM J AMW X 


7"?0fl 


vj^-rVvri x jw4L> v L w-'-jrtL- 4 jh/i/ilj i XLsLtAAAVJA^rVrtrkA^nwJvtrix ihv'Uw^J' 




tat* .a art apT*r ZLjuffpr^rii attpp'PT a rA.A£*"p-A a\A A a a 11 a a l r"t(Tivii^i 


unci 


1A1 2 1 1 1_? 1 TutpHx Xj- 1 IrArti H. 1 * /iiT.'i^/i L^Hkrt/l Lr>_« 1 t*i\s£i 1 #iu 1 1 Mi 1 1 Hw 


J J7Lr 


rnf a mpTL ptp* CT. rp n* 7. Tir^riT^B/*"L L TT L T TTT rr JlTTTPTTrL TPTTr a r»T 

1 LA i Lrtt, L wftvA 1 1 AMU x 1 I'^l J 1 l v x 1 L- tt X v J i vnU> X 


r iuu 


V XV/i 1 LAi U.H X LAL U. AjLAjA^'^AX wrt J 1 1L1 Lj-Li X Lj-U XA^l X LAUu X AALiLp 


7^^0 


l AOLftAXL-X^ALi J UX(jA\>^ X U X KX LtAALvrt/i J L« XlrLr X X WJ JV <vLJm^vXU-L. 
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& iTr^nrr a t pT-'ni t h**^ & & cic bj^p rtp itt a.n^rTfrTT(VTrTrr 

fWfci VvvvW/v\vn 4 L- -i *X rt.V_' l kj> J J /i Lfc U V * wfl 4 4 ^u^w^* X X L. X X X Li X tu 




TAPTPAIlP^THBT Pa TrrTTL TPPTP > PTP<^TPT v Tr*llPba1 , PILPranrt2iPT 
X/\L« X LnnUu X L>n.l Ln 1 LiLi 1 Vr/i X L»U 4 V— X L'^- iLl 1 L^lLarvtl LHLVjnUUrv* 1 
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Figure 12 



Caminen»e/Ref«r«n«**: AIMSH3 3' side antennae : AtMSH3 V (13 = 2l04bp) from 
pUCtsm Salirert1/T4 Into pCWl84 BBmhK/T4 In AgrcbBderlum LBA4404 
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Cai»m«nt»/Ref«renc4*: AtMSHC (S8) 3' side anlis*ns : $Z SalUSsl1/T4 (WSbp) 
into pCW164 B«mHl/T4 
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Figure 14 



CotnmtrilsfRetereTiD&a: AtWSHS a'/AtMSHa 3' antlSGniSB : AtMSKfi (SB) 3* side (62=l379bpl 
Bal1/6*M/T4 Into pPFn (pGWW MMSH3 [SB) 3' 9id» (t3«2l04> arKlseneySmat, in 
LBA44M 
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Figure 15 



ComtntnWReferoncea; AtMSKS 37A1MBK& 3' anJiaBna (D) : A1MSH3 [35} V side 
(t9*21Q4bp) SaM/Ssn/T4 inlo pPFl4 (AtMSKS (S-5) S'elde ($2»i379bp) H ntl*&n66 Into 
pCWfStySmat. (n LBA44CM 
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8210 EcoRl.Sael, Kpm 



Figure 16 



Commflntamiferenow: AMSH3 (SB) compl»ta, sonw orientation : pPF2B (334ibp) 
Eoiat Into pCW1G< Smal 
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Figure 17 
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Figure 18 
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Figure 19 
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1 

SEQUENCE LISTING 



<ilC-> Rhone- Paul enc Agro; Betsner r Andreas Stefan.: Doutriaux, 
Ma-tte-PBECsla,: Fre/seinet, Gsorgea ; Perez- , P&scu&l . 

<-i?o> Methods for c-bc-sdnlng plant varieties 

<13G> 39549SC 

<IS0> P03745 

cL51> 1997-10-10 

-:16G> 93 

<210> i. 

t2il> 23 

<212> DWA 

*:213> Artificial Rftqop.KC* 
<220> 

^221 > iriCKSi f iftd_hraKR 

-;223> I 

<220> 

*22l> modi £ i e-3 Joase 

-c252> 14 

-223 > 1 

<220> 

*221> modif ied_baae 

<222> 17 

<222> I 

<220> 

<223* DeganeratG oligonucleotides UPMU used to ieolata AtMSM3 and 

*30i> Hainan and Kolodner 

<r303> Genetics 

<303> 232 

<306n 963-973 

OP7> 

<400> i 

ctggatccac nggnccnaay ,^tg 23 



*210:> 2 
*211> 23 
*2l2> DNA 



WO 99/1949* 



-.21 3 > Artificial sequence 

v2il> modi f ied_base 

<:22H> I 

<220> 

-:22i> nuxiif ied^base 

<222> IB 

<223> T 

<220> 

*223> Degenerate oi igpnucl Botitfee EOMU used to isolate AtMSHJ and 
AtMStf £ - 

<301> Reenan and Kolodner 

^502^ Generics 

^■303> 132 

<306> 963-973 

rtggatccrt artgngcnrc raa 23 



<2i0> 3 

<21i> 24 

^212> DMA 

^21 3 > Artificial sequence 

<22t>> 

<223> Msfci Bpacific primer 6 3fc £or PCR using cDNA oi Ar&bidopeLs 
t ha 11 ana ecotype Columbia 

<40Q> 3 



tgctsgtgcc tcttgca&gc teat 24 



<210> 4 
c2Ll> 21 
<2\2> DWA 

c2l5> Artificial sequence 

*220> 

<223> PriToer aP} for pcr ueing cDNA of Arabidopsls tha liana ecotype 

Columbia containing adapter sequences ligated bo both its 
ends 



4 



WO Ml 9492 



FCT/EP9WWOT7 



ccatcctaat aCQactcsct ataggo r. 27 

c2ll>> 5. 
<211> 23 
-:-2l2> OKA 

=;2I3> Artificial sequence 

<220> 

<223> Primer AP2 fox PC* using cDMA of Arabidopais thaliana ecatype 

Columbia, containing adapter sequences ligatsd to both its 
ends 

<4DD> 5 

actcactata ggcrctegagc ggc 23 



^211> 3D 
^212> DN* 

Artificial sequence 

<220> 

^222.> HSH3 specific primer 3525 Eor PCJ? using cDNA Ol* Arabidopsis 

thsliana ecotype Columbia 



aggttctgar tttcgtgtgac gctrsactta 



<210> 7 
<212> DttA 

<213> Artificial sequence 

<22G> 

<223> WBH3 specific primer 351 for PCR using cDidA of Arabidopsia 

ttialiana ecotype Columbia 



ggatDgggfca ctgggttttg ag^gtgagg 29 



<210> e 

c2ll> 24 

<212> SNA 

■c2L3> Artificial sequence 
<220> 

<223> sp&ciiic primer 63 5 for PCR using catfA of ArafcidapaiB 

th3 liana ecotype Columbia 



WO£9/1W92 



gcacgtgctt gatggtgttc tcac 24 



<H6> 


9 


<211> 


26 


<2l2> 


DNA 




Artificial sequence 


v220> 




<223> 


MSH3 specific primer S52 3 




thai i ana GCOtypa Columbia 


<400> 




tcagacagta 


tccagcatgg eagaagta 




1Q 


<211> 


33 


^2i2> 


DNA 




Artificial scout* nee 


<220> 




<225> 


tfSH3 specific primer 1SB : 




thaliana ecotype Columbia 


<400> 


10 


atcccgggat 


gggca^gcaa aagcagcaga cga 


*2lO> 


11 


<21i> 


21 


<212> 


£>NA 


<21*> 


Art i f i c ia 1 b eque ac e 


<22D> 




<223> 


MSH3 specific primar S53 




thaliana ecotype Columbia 






gacaaig&gc 


gaaatgaggc cccttgg 


<21Q> 




<211> 


1250 




DMA 


<213> 


Arabldopsis t ha liana ecot 


<223=> 


Clone 52 



26 



33 



27 



PCT/EP9S/0697? 



& 

cccgggacgg gcaagca&aa gcagcagacg acdtctegtt tcttcgrtcc caaacccasa 60 

tccccgactc aegaaccgaa tccggtagcc gaatcatqae eacegceaec gaagatatcc 120 

gccactgtat ccttctctcc ttccaagcgt aagcttctct ecgaccacct cgcrgccgcg I BO 

tcacceaana agcctaaact £tctcctcac aetc&aaacc cagtacccga tcccaa.ttta 24 0 

caccaaagat ctctcc&gag atttctgg&a ccctcgcegg aggaacatgt teccgaaaeg 3 00 

tcatcatcga ggaaatacac accsttggaa cagcaagtg^ tggagctaaa gagcaagtac 3 50 

ccagatgtgg tttCgatijgt ggaagttggt tacaggtaca gattcttegg agaagacgcg 420 

gagatcgcag cacgc-gtgtt gggtatttac gctcatatgg atcacaattt cstgacggcg 4fl0 

agtgtgccaa cntttcgatt gaftL'XU ccai gtgagaayftc tggtgaatgc aggatacaag S4 0 

atcggtgcacj c.^aagcagac tgaaactgca qccattaagt c:c:catggcgc aaaccggacc £0D 

ggcccttttt "c^ggggact gtcgy^gn. 3 tataccaaag ceacgcttga agcggctgag £60 

gatacaagrg gT.ggtcgtgg tggtgaagaa ggttttggtt cacagagtaa tttcctggtt 720 

t-gtgttgtgg atgagagagt taagtcggag acattaggct gtggtattga aatgagtttt 790 

gatgttagag tcggtgttgt tggcgttgaa stttcgacag gtgaagct.gr. tcatgaagag 64 0 

ttcaatga.ta acttcatgag sagtggatta gaggctgtga ttttgagctt. gtcaccagct 900 

gagctgttgc Dtggccagcc tct&fccacaa caaaccg^g^ agtttttggt ggeacatgct 9 60 

ggacctacct caaacgttcg agtggaacgt gccccactgg at r.gt ttcag caacggtaat L020 

gcagtagatg aggttatttc attatgtgaa aaaatcagcg caggtaactt agaagatgat 1080 

AaAtJaa.atga agctggagge tgctgaaaaa jgsatgtctt gcttgacagt tcatacaatt 114D 

atgaacatgc racatctgac tgttcaagcc ctcgrcctaa cgttttgcca tctcaaacag 12 DO 

bttggatttg aaaggatcct ttaccaaggg gcctcatttc gctctttgtc 12 50 



<210> 13 

<.2ll> 34 

<l212> WA 

<213> Artificial ae-qua^ea 
c220> 

<223> MSH3 specific pritter 255 for PCR using CDNA of Arahidopfiis 
thaliana ecotypa Columbia 
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atccr;ggytc aa^atgaaca agt&g^cttfc. agtc 3 4 

<2l0p H 

^2Ll> 27 

. <2V2> ONA 

<2ll> Artificial sequence 

<22$> 

<22 3> specific primer S52 for E>C£ using CE>NA. t>f Arabidopais 

cha Liana ecotype Columbia 

<4D0> 1-1 

gccacatctg actgttcaag ccctcgc 27 

<210> 15 

<21L> 211& 

i2--2> UNA 

*:2i:>> ^r<3bir?i>psi£ t'r.aizizna ecotyp<r Columbia 

*:223> Clone 13 

^300> ].r> 

gcc&catctg actgttcaag ccctcgccct sacgtcttgc catctcaa&c agtttggaer. 60 

tgaaaggatc ctttaccaag gggccxcatt tcgctctttg tcaagtaaca cagagaLgac 12 0 

ccc cucagcc aatactccgc aacag^t^ga ggttgtgaaa aacaar.tcag atggatcg^a, 160 

atetggc^cc ttattccata atatgaatca cacacttaca gzat ncggtt ccaggcttct 2410 

tagscoctgg ^cgskcxeatc ctctacgcga tagaaacttg atatctgctc ggcttgatgc 300 

tgtttctgag atttctgctt gcatgggatc tcatagttct tcccagecca gcagtgagtb 3€Q 

ggttgaagaa ggttetgaga gagcdabt^t atcacctgag tttfcadctcg tgctctcctc 4 20 

agtettgaca gccatgtcta gatcatccga ta&tcaacgt ggaataacaa gaatctttca 4 80 

Ccggactgct assagccacag agttcattgc agttatggaa gctattttac ttgcggggaa 54 0 

gcaaattcag cggcttggca taaagcaaga ctctgaaatg agga§tat:gc aatctgcaac 600 

tgtgcgatct actcttttge gaaaattgat ttctgtCatt tcatcccetg ttgtggtbga €60 

cAAtgccgga aaacttctct <$tg<:cctaaa taaggaagcg gctgttcgag gtgacttgct 720 

cgacatacta atcactteca gcgaccaatt tcctgagctc gctgaagctc gccaagcagt 7&0 

tttagtcatc agggaaa&gc tggattcctc gatagcctca t&tcgcaaga agcccgctat 340 
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t Lyd idl U 


g l c CC v C-C 


«a.gtgtcggg 


gatcacacat 


ttgatagagc 


tgcccgttga 


500 


L. <_ □. i3. y J L. 


,-1 p |- j 1— r, .3 tr K 


yQCjcgaact'^L 


aaatagcacc 


aagaagacta 


trzegatatea 




\- K -* '—■ V LrQ^j a 








gcBa ctqaar 


cLtgccac 


1020 


y Q n. H— y ck 


y - u «- ^ i_ yy^ 


m h h t k r*r« -t- 
•3. t_ c*.y LLLLLL 


Lnagagtcto 


agcagatact 


3 cacagat 1 1 




l -jicrsj y i_ y u^v* 


r YT - t~ -f^-i Arr ^-f r** 


I" t~ t't'f Y" n f* *i» .-"•■■p 


ggactgt ttg 


cact cccttt 


caactctatc 


1140 




C-51 li-'vr c 




cgtggat*gfAc 


tgcgc*accag 


ttgagatAaa 


1200 


o. l. a. k_ akj i_ i_ u 


yy l L7y t c ate 


CCy L actgy a 


gactatatta 


caagataact 


Ccgtcccaaa 




4 -H * .<« A A 1** K 


t tgcacgcag 


aaggg^aa ta 


ttgecaaatt 


atcaccggac 


Dtaacatggg 


132D 


^•Tr:a a s.*^ a n 


LLJ <_ U. id. L d L L L 


^ ^ ~» +■ 

Lj LCq a-y •_ c 


me sac vtcc 


^taatg^ctc 


a ?9tt-ggttc 


13&0 


k* 4* .4^' Li. a aj aj a 

Cl- l, l,3 t ft CO A 


gcgtcattc9 




tgtgcctgat 


ggtgttttca 


ctcggatggg 


1440 


t get tcsgac 


agtatccagc 


atggcagaag 


LBCCttLCta 


gaagaat f;aa 


gtgaagcgtc 


150D 


civs C« - a 3 1. -n 


acjaacctgr.t. 


ctt ct cgttc 


qcrt:g^c9t9 


ttagatgagc 


r.r.ggaagag^ 






u-acgaeggtg 


vageca ci*gc 




1 1 acagcatc 


tcccagcaga 


162Q 


aaac-agatg L 


ttggttctt t 


ttgccac^ca 


t Laccctgaa 


atagctgaga 


Lccgtaacgg 


1630 


St t CvCag^g^ 


cctgdtggga 


cat accatcj t 




acategcaga 


aggataaagg 


1740 


_a v a i* .t r% 

CagT-cacgaC 


catgatgaxg 


cgacc^accL 


ftbat&agccL 




ettgeagcag 


1300 


yayc Cl I gg t 


t ctaaggt 


-ctc-agcttgc 


GcagaCacct. 


ccatcatgta 


taogtcgagc 






getgeaaaar. 




g^cacgcgca 


agagagagaa 


atacacgcaJ; 


132& 


gg<fagaacca 


gaaggacatg 


^agaaccgas 


Aggcgcagaa 


gaatctattt 


cggCtCtagg 




tgarttgttt 


gcagacctga 


aatttgetet 


ctctgaagaof 


gaccctt^ga 


aagcattcga 




gtttttaaag 


catgcttgga 


agattgetgg 


caaaatcaga 


ctaaaaccaa 


cttgttcatt 


2100 


ttgacccggg 












211P 


«210> 
<211> 

<212> 


Lb 
29 
DMA 


1 











<213> Artificial s guence 



<220> 

<22 ^> MSK2 Epeciiic primer 3$1 fpr PCH UBing ctfMA of, Arabidop^is 

thai i Ana ecotyp« Columbia 



<400> 



16 
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■ggtttcggcjta ctgggttttg agtgtgagg 2 9 



<2lQ> 11 
<212> DNA 

<213> Artificial sfcffuence 

<22Q> 

<£23> specific primer S52 5 for PGR using CC-NA Of Arabidopaiq 

thaliaaa ecotype Columbia 

S^gttetgat tatgtgtgae gcctca^tta 30 

<iq.D> IS 

<2l2> DNA 

<2L3> AraJbicJopsii; ctotiittiHi ecotypc Columbia 
<220* 

^22iv CDS 

c222> (100) .... (1342) 

<=22 3> Atsosio fell -length ctfNA and deduced sGt^uence of zha encoded 
polypeptide 

<400> i8 

CC^aagasag cgcgcgaaaa ttggcaaccc asgttcgcca tagrcacgac cacgaectCC SO 

catttctctt aaacggag$a gattacgaat aaa^ca^ct 99 

atg ggc aag eaa aag cag csg acg ate tct cgt etc ttc get ccc aaa 14 7 

Wet Gly Ly b Gin Lya Gin Gin Thr He Ser Arg Pfce Phe Ala Pro l^ya 
15 10 15 

ccc aaa tec ecg act cac gaa ccg aat ccg gta gee gaa teat tea aca 195 
Pro Lys Ser Pro Thr His Glu Pro h&tx Pro v&l Aia Glu Ser ser Thr 
20 35 30 

ccg cca ccg aag ata tec gec Ace gta tec ttc tct ect tec a&g cgt 343 
Pro Pro Pro Lys Tie Ser Ala Thr Val Ser Phe Ser Pro Ser Lya Arg 
35 • 40 45 

aag ctt etc tec gac cac etc $cc gec gcg tea ccc aaa aag cct aaa 
Lye Leu Leu Ser Asp Hie Leu Ala AL& Ala Ser Pro Lys Lye pro Lya 

30 55 60 

v 

ctt tct cct cac act caa aac cca gta ccc gat ccc aac tta cac caa 339 
Leu Ser Pro Hia Thr Gin Asn Pro Val Pro Asp Pro Asn Leu His Qln 
$5 70 75 80 
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aga ttt ccr cag aga btc ctg gaa ccc ccg ccg gag gaa tat gtt ccc 397 
Ax^ Phe Leu r^a Arg Phe Lam Glu Pro S«?r Fro Glu Glu Tyr Val pro 

35 * $0 95, 



^aa acg r.ca tea teg agg aaa cac aca Cca ttg gas cag csa gtg gtg 
GlU Thr Ser Ser Sor Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val 
l&U 105 no 



435 



gag eta aag age aag tac era gat gtg gtt ttg atg gtg gaa gtt ggt 
Glu Leu Lys Ser Lys Tyr Pro Asp Val Val Leu wet Val Glu vaL Gly * 
115 220 125 

tac agg tac aga ttc ttc gga gaa gac gcg ^ag ate gca gca cgc gtg 531 
Tyr Arg Tyr Arg Phe E>be Gly Glu Asp Ala Giu Tie Ala Ala Arg Val 
13* 135 14 D 

ttg ggc att tac get cat atg gat cac aat etc atg acg gcg agt gtg 57 9 

Leu Gly He Tyr Ala Hits Met Asp His Aeji t>he wet Thr Ala Ser V3I 
1^5 150 160 

cca aca ttt cga Ctg sac ttc ca^ gug aga aga ctg gtg aat gca gga 62? 
Pro Thr Phe Arg Leu Asn The His Val Arg Arg Ley Val Asn Ala Gly 

3.7i> 175 

tac aag ate ggt gta gcg aag cay act gaa act gca gec att aag tec 61S 
Tyr Lys lie Gly Val Val Lys Gin Thr Glu Thr Ala Ala lie bya Ser 
160 1B5 isg> 

cat ggt 3ca aac egg acc ggc CCt ttc ttc egg gga ctg teg gcg ttg 723- 
Bis* Gly Ala A&n Ars Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
135 2UD 205 

tat ace aaa, gec acg Ctt gaa gcg get gag gat s-a agt ggt ggt tgt 71 1 

Tyr Thr Lys Ala Thr Leu Glu Ala Ala Glu Asp T.l* ser Gly Gly Cys 
210 215 220 

ogt ggc gaa gaa ggt ttt ggt tea cag agt aat ttc ttg gtt tgt gtt 319 
Gly Gly Glu Glu Gly Phe Gly Bnt Gin Ser Aen Phe Leu Val Cya Val 
225 "0 235 240 

fftg gat gag aga gtt aag teg gag acs tea ggc tgt ggt att gaa atg 867 
Val Aap Glu Arg Val l*ys ser Glu Thr L*u Gly Cys Gly Jle Glu Met 
245 25Q 255 

agt ttt gat gtt aga gtcr ggt gtt gtt ggc gtt gaa att teg aca ggt 915 
Sar Ph* Asp Val Arg val Gly Val Val Gly Val Glu He Ser rhr Gly 
260 26$ 2-to 

gaa gtt gtt tat gaa gag ttc aat gat aat ttc atg aga agt gga tta 963 
Glu Val Val Tyr Glu Glu' Phe Abji Asp Asn PhB Wet Arg Ssr Gly Leu 
5?S ?BQ 2B5 
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gag get gtg att ttg age ttg tea cca get gag ctg tug ctt ggc cag 1011 
Clu Ala Val He Leu Sfcr Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin 

CC1 cut tea caa ca* a or. gag aag ttt ttg gtg gca cat get gga cct 1QS9 
Pro l^ru Ser CJln £ln Thr GLu LyE Phe Leu Val Ala Met Ala Qly Pro 
305 31D 31S 320 

acc tea asc gtt cga gtg gaa cgt gec tea ctg gat tgt tte* age aat 1107 
Thr Ser Asn VaL Arg Val Glu Arg Ala Sor LfcU Asp Cys £he Ser Aan 
355 330 " 335 

ggt aat gca gta gat gag gtt att tea tta tgt gaa a&A ate age gca 1155 
Gly Asn Ala Val Asp Glu Val He Ser Leu Cy* Gl,u Lys lie Ser Ala 
340 335 350 

ggt aac tta g&a gsc gst aaa gaa atg aag Ctg gag get get gaa as a 12 03 

Gly Asn Leu Clu Aap Asp hys GLu Met Lys Loo Glu Ala Ala Glu Ly& 
555 360 365 

gga atg tct tgc ttg aca gtt eac ata att atg aac atg cca cat ctg 1251 
Gly Mec Ser CyB Leu Thr Val Hi^ Thr lie Met A^n Met Pro Hie Leu 
370 37S 3B0 

act gtt caa gee etc gec Ctft acg ttt tgc cat c:o aaa cag ttt gga 1299 
Thr Val Gin Ala i,eu Ala Liu Thr Pht> Cy£ Bis Leu Lyo Gin Plie <Uy 
3£S 350 395 400 

ttt gaa agg ate ctt tac caa ggg gec tea ttt cge r.rc ttg tea agt 134 7 

Phe Glu Arg zle Leu Tyr Gin Gly Ala Scr Ph<i Arg Ser L<eu Ser Ser 
405 410 415 

aac aca gag atg act etc tea gec &at act etc* C&a cag ttg gag gtt 13$ 5 

A$n Thr Glu Met Thr Leu Ser Ala Asn Thr Leu Gin Gin L&u Glu VaL 
420 425 430 

gtg aaa aat aat tea gst gga teg gaa tct ggc tec tta ttc eat aat 144 3 

Val Lys Asn Aan Ser Asp Gly Ser Glu Ser Gly Ser Leu Phe Hia Aen 
435 440 445 

atg aat cac aca ctt aca gta tat get tec agg ctt ctt aga cac tgg 14 S3. 
Wet Asn Kis Thr LftU Thr Val Tyr Gly ser Arg Ley Leu Arg Hi 6 Trp 
450 455 460 

gtg act cat cct eta tgc gat aga aat ttg ata tct get egg ctt gat l539 
Val Thr HiF Pro Leu Cye Asp Arg Aan Leu tie Ser Ala Arg Leu Aap 

470 475 4B0 



get gtt tct gag att tct, gec tgc atg gga tct cat agt tct tec cag 
Ala Val ser Glu Xle Ser Ala Cys Met Gly Ser Hie Ser Ser Ssr Gin 
4B5 495 



153 7 



WO 99/1949* 



PCT/EP98/06977 



11 

etc age age gag £tg get gaa gaa gqz tct gag aga gca att gtA tea 1635 

L*u Ser Ser G'lu Leu Vai Glu Glu Gly Ser Glu Ar^ Ala He Val 3er 

SOD 505 SID 

cet gag ttt tet CT-C gtg etc tec tea gtc ttg aea get atg tct aga 1633 

Pro Clu Phe Tyr Leu Vrtl Leu Ser Ser Val Leu T'hr Ala Wet Ser Arg 

Sl$ 520 B2S 

tea tct gat att caa cgt gga ata aca aga ate Ctt cat egg act get 1731 

Ser Ser Asp He Gin Arg Gly II* Thr Arg lie Phe Elifl Arg Thr Ala 

B30 ^40 

aaa gee aca gag etc att gea gtt atg gaa get att tta ctt gcg ggg 1779 

Lye Ala Thr GLu the lie Ala Val Wet Glu Ala lift LaU Leu Ala Gly 

550 555 560 

aag caa att cag egg ctt ggc ata aag C?aa gac tct gaa Atg Agg sgt 1827 

Lys Gin He Gin Arg liftU Gly He Lys. Gin Aap Ser Glu Met Arg Ser 
565 57* 575 

atg caa tct gca acc gtg eg a tut act ctt ttg aga aas ttg att tct 1B?S 

Mec Cln Ser Ala Thr val Arg Ser Thr Leu Leu Arg Lys Leu lie ser 

b&0 5^5 £9f> 

3tt atr. tea tec ccc g^.t -gtg gw. gac aut gec gga aaa ctt etc tct l£23 

Val lie Scr Ser Pro V.al Val Val Aap Asn Ala Gly Lya Leu Leu Ser 

5SS 600 ^05 

gee eta aat aag gaa geg get gtt cga g<yt gac ttg etc gac ata eta 1&71 

Ala Lieu ASii LyS Glu Ala Ala Val Arg Gly Asp Leu Leu Aep lie Leu 

610 615 §20 

ate act tec age gac caa ttc cot gag ctt get gaa get cgc caa gca 20i£ 

lis Thr Ser Ser Asp Gin Pbe Pro Glu Leu Ala Slu Ala Arg Gin Ala 
625 630 635 £40 

gtc fcfca gtc ate agg gaa aag ctg gat tec teg ata get tea ttt cgc 2057 

Val Leu Val lie Arg Lys Leu Asp Ser Ser Ila Ala 3er PtiB Arg 
645 65& £55 

aag aag etc get att cga aat ttg gaa ctt ctt caa gtg teg ggg ate 2115 

Lya Lya Leu Ala lie Arg Asn Leu Glu Phe Leu Gin Val S*r Gly He 

660 €65 £70 

aea cat ttg ata gag ctg ccg gtt gat tec aag gtc cct atg aat tgg 2i$3 

Thr Hi& Leu Tie Glu Lbu Pro Val Aap $er Lys Val Pro Kla Aan Trp 

«75 680 685 

gtg aaa gta aat age acc aag aag act ace cga tat cat ccc cca gaa 25 11 

Val Lys Val Asn Ser Ttir Lya Lya Thr He Arg Tyr His Pro Pro Glu 

690 5$5 700 
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ata gta get ggc Ctg gat gag eca get Cta gca act gaa cat Ctt gec 225? 

Ile Val Ala Gly Lau Asp Glu Leu Ala Leu Ala Thr Glu Kia Leu Ala 
7D5 710 • 7L& 720 



act gtg «*<ac ega get teg tgg gac agt ttc etc sag agt ttc agt aga 
Ile Val abji Arg Ala &er Trp Asp Ser phe Leu Lys Ser Phe Ser Arg 
725 730 735 



tac tac aca gat ttt aag get gec get caa got <rrt get gca cbg gae 23 5 5 
Tyr Tyr Thr Asp Phfc fcys Ala Al* Val Gin Ala Leu Ala Ala Leu Asp 
740 745 750 



tgt ttg cac tec ctt tea act eta tct aga aae aag aac tat gtc cgt 
Cys leu His Ser Leu Set Thr Leu Ser Arg Asn Lys Astl Tyr Val Arg 
"?55 760 7CS 



ccc gag ctt gtg gat gac tgt gaa cca gtt gag aba aac aba cag tct 2451 
Pro Glu Phe Val AGp Asp Cys Glu Pro Val Glu lie Abii lie Gin Ser 
7^0 77S 7B0 



ggt egr- cat oct gn.a r;r.g 999 ao\ aui ttca cae gac aac ttc gtc cca 24 9 9 
Gly Arg Hie i^ro Val Leu Glu Thr ile Leu Gin Asp nsn Phe Val Pro 
7SS 795 BOO 

aat gac aca acx trg cat gca 1399 g^a cat tgc can. att ate ace 254 7 

Asn Aep Thr Ile Leu Kis Ala Glu Gly Glu Tyr Cys Gin lie lie Thr 
B05 810 BIS 

gga cct aac atg gga gga aag age cgc tac ate cgt can gtt get tta 259 & 
01/ Pro Asn Met Gly Gly Ly& S^r Cys Tyr Ile Arg Gin Val Ala Leu 
S20 62S g 30 

att ccc ata atg get cag gtt ggc lcc Let gta cca gcg tea ttc gee 2643 
lie Ser lie Met Ala Gin Val Gly Ser Phe Val Pro Ala Ser Phe Ala 
&35 B40 345 



aag ctg cac gtg ctt gat ggt gtt ttc act egg atg ggt get tea gac 
Lys Leu His val Leu Asp Gly val Pha Thr Arg M*t Gly Ala Ser Asp 
B50 855 B«0 

agt ate cag cab ggc aga. agt acc ttt eta gaa gaa tta agt gaa gcg 2129 
Ser lie Gin flia Gly Arg Ser Thr Phe L&U Glu Glu Leu Ser Glu Ala 
3*5 870 675 SfcD 

tea cac ata ate aga acc cg't tct tct cgt teg ctt gtt ata tta gat 2787 
Sar Hia lie lie Arg Thr Cys Ser Ser Arg Ser Leu val lla L«su Asp 
BBS B90 9?5 



gag ctt gga aga ggc act, age aca cac gac ggt gta gee att gee tat 2B35 
Glu LftU Gly Arg Gly Thr! Ser Thr Kia Aap Gly Val Ala lie Ala Tyr 
300 ?05 910 
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gca aca Lta cag cat etc eta gca gaa aag aga cgt ttg gtt ctt tct 2BB3 

Ala Thr Leu Gin. His L«u Leu Ala Glu Lys Arg CyS Leu Val Leu Pbe 
91S 320 - 525 

gtc acg cat tac ect gaa eta get gag ate agt aac gga ttc cca ggt 2531 
Val Thr Kia Tyr Pro Glu lie Ala Glu lie Ser Asn Gly Phe Pro GLy 
9$0 935 940 

tct gtt ggg aca tad cat gtc teg tat ctg aca ttg cag aag gat aaa 2 97 9 

Ser Val Gly Tkr Tyr fcia Val £er ryr Leu Thr Leu Gld Lya Asp Lya 
945 350 955 960 

ggc agt tat gat cat gat gat gtg ace tae eta tat aag ctt gtg cgt 3057 

Gly £er Tyr Aap Hi& Asp Asp Val Thr Tyr Leu Tyr Ly& Leu Val Arg 
3*5 £75 

» 

ggt Ctt tgc age agg age ttc ggt ttt aag gtt get cag ctt gee cag 2075 

Oly Leu Cys Ser Aig Ser Phe Gly Phe Lys Val AU Gin Leu Ala Gin 
930 995 990 

&ea CCC CCft r,ca tgt ata cgt ega gec a^t tea acq get gca aaa ttg 5123 
Lie Pro Pro Ser Cys He Arg P,xq Ala Tie Set Met Ala Ala Ly £ Leu 
99S 1Q&0 1005 

gaa get gag gts ecjt gca aga gag aga as: aca cgc atg gga gaa cca 3171 
Glu Ala Glu Val Arg Ala Arg tJlu Arg AST) Thr Arg Met Gly Glu pro 
lOlO 2015 1020 

gas gga cat gaa gas ccg aga ggc gca gas gaa tct att teg get eta 3 219 
Glu Gly Ki£ Glu Glu Pro Arg Gly Ala Glu 5lu He Ser Ala Leu 

1G2S 1030 1035 1040 

ggt gac ttg ttt gca gac ctg aaa r.tt gcr. cr,c ten qaa gag gac cct 3267 
Gly Asp Leu Phe Aid Asp Leu Lys Ph6 Ala Leu Scr Glu Glu Aap Pro 
1D45 10S0 1055 

tgg aaa gca ttc gag ttt tta aag cat get tgg aag att get ggc aaa 3315 
Trp Lye Ala Phe Glu Phe Leu Lye Hie Ala Txp Lys lie Ala Gly Lya 
1060 1065 1070 

ate aga eta aaa cca act tgt tea ttt tgaxttaatc tcaacattat 3 362 

lie Arg JLeu Lye Pro Thr Cys Sar Pha 
1075 IQfiO 

ageaactgea aggtcttgat catfctgttag ttgegtacta scttatgtgt attagtataa 3422 

caagaaaaga. gaattagaga gatggattct aatccggtgt tgcagtacat cttttctcca 34 B2 

qccgcataaa aaaaaaaaaei aaaaaaaaaa aaaaaaaaaa 3522 



<2l0> 13 
<211> 10B1 
<212> PKT 
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'-2 23 j Ar&)>i<iop$is thai i ana ecotype Columbia 

<221> Polypeptide 

r-4D0> 1^ 

Met Gly Lys Gin Lys Girt Gin Thr lie Ser Arg f>he Phe Ala Pro Lys 
1 * 10 IB 

Pro Lys Se* Pro Thr Kia Glu Pro Abji Pro Val Al£ Glu Ser Ser Thi* 
20 25 20 

Pro Pro Pro Lys lie Ser Ala Thr V3l Ser Phe Ser Fro Ser Lys Arg 
35 40 45 

Lys Leu Leu Sat Asp His leu Ale Ala Ala Ser Pr*> Lys Ly& fro Lye 
SO 55 * 60 

Leu Ser Pro Kia Thr Gin A^n Pro VaJ. pro Asp Pro Asn Leu ftis Gin 
£5 70 75 B0 

Arg Ph<2 Leu Cln Arg Ph* Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro 

&5 90 DS 

Glu Thr Ser Ser Ser Arg Lys IV ^ Thr Pro Leu Glu Gin Gin Val Vol 
100 1D5 11D 

Glu Leu Lya Ser Lys Tyr Pro Asp V*il Val Leu Wet Val Glu Va"1 Gly 
=U5 120 125 

Tyr Arg Tyr Arg Phe Phe Gly Giu Asp Ala Glu He Ala Ala Arg Val 
130 135 140 

Lfcu Gly lie Tyr Ala His tfet A Bp His ACtt Pbe Me*: Thr Ala Ser Val 
14S ISO 155 1G0 

Pro Thr Phe Arg Leu Asti Phe His Val Arg Arg Leu Val Asn Ala Gly 
165 170 175 

ryr Lys lie Gly Val Val Lys Gin rhr Glu Thr Ala Ala He lys s«r 
1*0 135 190 

His Gly Ala AST! Arg Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu 
195 2D0 205 

Tyr Thr Lys Ala Thr Leu Giu Ala Ala Glu Asp lie ser Gly Gly eye 
210 215 220 

Gly Gly Glp Olu Gly Phe Gly S*r Gin 5*r hst* Phe Leu Val Cys Val 
225 23Q, 2j5 24D 

Vftl A^p Glu Arg Val Lye Ser Glu Thr Leu Gly Cyu Gly lie Glu Met 
245 250 2S5 
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S^r Phe Asp V,U Arg Val Gly Val Val Gly Val Glu Lie Ser Thr Gly 
260 265 570 

01 Li Vdtl VaI Tyr Glu Glu Phe Asn Asp Asn Phe Wet Arg Ser Gly Leu 
275 230 285 

Glu Ala Val lie Leu S©jf Leu Set" Pro Ala Glu Lfcu Levi Leu Gly Gin 
29f> 29S 3-00 

Pro Leu Ser Gin Gl A Thr Glu Ly& Ph* Leu Val Ala Mat Ala Gly Pro 
305 319 315 330 

Thr Sex Aan Vsl Arg Val Glu Arg Ala Ser Leu Aep Cya Phe Ser Asn 
32S 320 335 

Gly Asu Ala Val Asp Glu Val lie Ser Leu Cys Clu Lys 11* Ser Ala 
34Q 245 350 

Gly Asn l>*u Blu Asp Asp Lya Glu Met Ly9 Leu Qlu Ala Ala Clu Lye 
355 3^0 3*5 

Ciy Mftt iier cy& Leu Thr tfal Kis Thr He Met Aan Mot Pro His Leu 
3T0 375 380 

Thr Val Gin Ala r„eu Ala Leu T2:r Phe Cya- Hi a Leu Lya Gin Phe Gly 
3S5 39D 300 

Phe Glu Arg lie Leu Tyr Gin Gly Ala &er Phe Arg Ser Leu Ser Sex 
4Q5 410 415 

Asn Thr Glu Met Thr Leu £er Ala A&n Thr Leu «ln Gin Lfcu Glu Val 
420 425 430 

Vnl Lys Asn Aan Ser Asp Gly Ser Glu 6er Gly Ser Leu Phe Hie Aan 
435 440 445 

>1et Asn His Thr Leu Thr Val Tyr Gly Ser Arg Leu Leu Arg KJ.e Trp 
454 455 460 

Val Thr Hie Pro Leu Cys Asp Arg ABU Leu lie Ser Ala Arg LAU Asp 
465 470 475 4fi0 

Ala Val Sex Glu lie Ser Ala Cys Mat Gly Ser Ki3 Ser fior Ser Gin 
435 490 495 

Leu Ser B&r Glu Leu Val Glu Glu Gly Ser Glu Arg Ala lie Val Ser 
500 505 510 

Pro Glu Phe Tyr Leu vaJL Leu Ser Ser val leu Thr Ala Met Ser Arg 
515 520 525 

Ser Ser Aep Tie Gin Arg Gly Tie Fhr Arg lie Pha His Arg Thr Ala 
53D 535 540 
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Lys Ala Thxr Glu Phe lie Ala Val Met Glu Ala lie Leu 1/su Ala GLy 

S50 555 560 

Lys Gin ILe Gin Arg Leu Gly He Lya Gin Asp Ser GLu Wee Arg Ser 
565 570 575 

Met Gin £er Ala Tbr Val Arg Ser Thv Leu LftU Arg Lys Leu lie Ser 
5S0 585 590 

Val He sex ser pro val Val v a i Asp Aan Ala Gly Lys Ldu Leu Bar 
£95 &00 €05 

Ala Leu Asn Lys Glu Alas Aid Val Arg Gly Asp Leu Leu A&p Ila Leu 
610 £15 620 

He Thr Ser Ser Asp din Phe Pro Glu Leu Al* Glu Ala Arg Gin Ala 
625 630 

Val Leu '^Al He Arg Giu Lys L'ju Asp Ser Ser He Ala Ser Phe Arg 
6*5 £50 655 

l.ys Lys Leu AJa I]* Arg Asn Clu Phe Leu Cln Val Ser Gly He 

660 66b 670 

Thx 31 is Leu rle Glu Leu Pro val A^p ser Lys Val Pro His Ago Trp 

e&o 6a5 

Val Lys Val Asn Ser Thr Lys Lye Thr lie Arg Tyr Hie Pro Pro Glu 
6 90 700 

He Val Ala Gly Leu A£p Glu Leu Ala Leu Ala Thr Glu Hi b Leu Ala 
™5 710 7lS ISO 

He Val A&n Arg Ales Ser Trp Asp Ser Phc Lou Lys Ser Pha Sb£* Arg 
725 7i0 755 

Tyr Tyr Thr Asp Phe Lys* Ala Aia Val Gin Ala Leu Ala Ala Leu AGp 
740 745 750 

Cys LfcU His Sbt Leu Ser Thr Leu 5er Arg Asn Lys ASA Tyr Val Arg 

760 765 

Pro Glu Phe Val Asp Asp Cyff Glu Pro Val Glu lie Asn He Gin Ser 
770 775 790 

Gly Arg His Pro Val L#u Glu Thr lie Leu Gin Aap Aan Phe Val Pro 
765 790 795 800 

Asn h&p Thr He Lbu Hia Ala Glu Gly GLu Tyr Cye Gin He He Thr 
«05 .' aio S15 

Gly Pro Asn Met Gly Gly Lys Ser Cya Tyr Ila Arg Gin VrI Ala Leu 
82D 825 $ 30 - 
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lie Ser lie M*L Ala Gin- Val Gly Ser Pbe Val Pro Ala ^er Phe Ala 

640 945 

Lys l^eu Hi a val lieu Asp Gly Val Phe Thr Arg Hen Gly Ala Ser top 
650 S55 GGti 

Ser He Gin His Gly Axg ser Thr Phe Leu Glu Glu Leu Ser Glu Ala 
8£5 S^O 675 S&O 

Ser Hia lie He Arg Thr CyS S&r Ser Arg 5er Leu Val lie Leu Asp 
&GS 990 935 

Glu L*u Gly Arg Gly Tbr Ser Thr Hid Asp Gly Val Ala Zle Ala Tyr 
300 503 910 

Ala Thr LgU GlP Hia LCU Leu Ala Glu Ly& Arg Cys Leu Vsl Leu Phe 
915 920 325 

Val Thr Hie Tyv Pro Glu He Ala Glu He Ser Aen Gly Phe Pro Gly 
?30 935 940 

Ser Val G>y Thr 'tyr His Val Ser Tyr bey Thr Leu Gin Lya Asp Lyc 
945 9*0 95* 560 

Gly Ser Tyr Asp Bis Asp Asp Vrtl Thr Tyr Leu Tyr LyB Leu Vai Arg 
965 97D 975 

Gly Leu Cy£ Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 
980 9B5 390 

lie Pro Pro Ser eye lie Ar-£ Arg Ala He Ser Wet Ala Ala Lye Leu 
*95 1D0Q lO^S 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn rhr Arg MeL Gly Glu Pro 

101& 102Q 

Glu Gly His Glu Glu Pro Arg Gly Ala GlU Glu Ser He Ser Ala Leu 
1025 IQiO 1035 1040 

Gly Asp Le-U Phtt Ala Asp Leu Lye Phe Ala Leu Ser Glu Glu Asp Pro 
1045 1050 1055 

Trp Lye Ala Phft alu Phe Leu Lys His Aid 7rp Lys He Ala Gly Ly£ 

1050 1065 1070 

t 

lie Arg Leu Lya Pro Thr Cys Sex Pha 
1075 10BO 



-c2io> 20 ; 

<2H> 24 

<2i2> DNA 

<213> Artificial sequence 
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1B 

<220p 



<2'2l> 


specific primer 63 6 for PCR. vising cDtfA of AraMdopsie 
thaliarxa adOtype Columbia 




20 








24 


<2lO> 
<212> 


21 
29 
DNA 

Artificial sequence 




<22t>> 
<222> 


Printer S81 for PCR using cDNA 
Columbia 


of Arat>i<3apsis t liana ecotype 




21 




cgtcgccLt t 


agcatcccct tcctrcac 




<21D> 
<2ll> 
<212* 
c2L3> 


22 
30 
DMA 

Art j. f i c i a 1 ee-giiB nc b 




<220> 
<2i3* 


MSHCJ specific primer S323 tor 
thai ia da ecotype Columbia 


PCS vsirig cDNA of Arabidop&is 


<*0O> 


22 






tetaatagaa tcatgacsgg 




<210> 
<r2ll> 
<2L2> 
<2l*> 


23 
DNA 

Artificial sequence 




<22ti> 
<22l> 


MSKS specific printer 637 for 
ttialiana ecofcype Coluircbia 


PCR xjsing cDNA of Arabidopfiis 




23 






24 



<210> 
<2l2> 



24 
DMA 
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2 1 3 > Artificial sequence 



atCcCgggat gcagcgccag agatcgantt tgt 



<2M» 2? 
<2I2> DNA 

<2l^> Artificial sequence 



33 



*222> MSH6 specific priiw.r for PCR using cDNA of Arabidopeis 

t ha Liana ecotype Columbia 

cgc^-it. ctat ygctgcttcg uut Cga^ 27 
<212s> X>UA 

<2l3> Arfibidopeis thaliana ecotype. Columbia 

-=223> Clone 43 

<40G^ 2* 

cccgggatgc agcgccagag arcgattttg tctttcttcc aaaaacccac ggcggcgact 6 0 

acgaag^gtt tggtttccgg c^atgctgct agcggegggg gcggcagcgg aggaceacga 12 0 

LCHA&tgtga aggaagggga tgctaaaggc gacgctcctg tacgttttgc tgt ttcgaaa Lao 

tctgrcgatg aggttagagg aacggatact ccaccggaga aggttccgcg tcgtgtectg 24 0 

ccgcefcg§at ttaagccggc tgaatccgcc ggtgatgctt cgtccctgtt ctccaatatt 3 00 

atgcatiagt ttgtaaaagt cgatgatcga gattgttctg gagagaggag cegagaagac 3 60 

gttgttccgo tgaatgattc atctctatgt atgaaggcca atgatgttat CdC&caafctt 420 

cgtcecaata atggtaaaac tcaagaaaga aaccatgctt ttagCttcag tgggagagfct 4 80 

gaacctag&t Cagtagaaga tkbaggagta gatggcgatg ttcctggtcc aga&acacca 540 

gggatgcgtc cacgtgcttc tcgcttgaag egagttctgg aggacgaa&t, gacttctaag £0 0 

gaggataagg tccccgtatt ggactctaac aasaggctga aafctgctcca ggatccggct £6 0 
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cgr.g^agaga 


<*gaaag3agt 


aaacgascjga 


accaaat ttg 


aatggcttga 


gtcttctcga 


720 


atcagggatg 


ccaatagaag 


eicgtcctgat 


gatccrcttt 


acgatagaaa 


g^ccttacae 


"730 


ataecacctg 


at^tcttcaa gaaaatgtct 


gcatcaca-a* 


a<3caa&attg 


gagtgttasg 


940 


acjtgaaeata 


tggacatt^t: 




aaagtgggga 


aattttatga 


gct9tatgag 


900 


ctagatgcgg 


aattatggtca 


caaggagctt 


gactggaaga 


tgaccatgag 


tggtgtggga 


950 


aeatgcagac 


aggttggtat 


ctctgaaagt 


gggatagatg 


aggcagtgea 


aaagctatta 




gcccgtggat 


atasagttgg 


icgaatCqag 


csgctagaaa 


catctgacca 


ageaaaagee 


1080 


agaggtgcta 


at aetata at 


z ccaaggaag 


CLagttcagg 


tatcaact.ee 


atcaacagea 


114Q 


agegagggaa 


acateggger 


tqatgccgtc 


catcctcttg 


ctaaa&aaga 


gatcaaaatg 


J.2D0 


gagcLacaaa 


agtgttcaac 


tgtgtatgga 


tttgcttttg 


Ltgactgtgc 


tgecttgagg 


126Q 


Lt ttgggt'g 


sgtccutcag 




icatgtgccg 


ccctcggagc 


gttattgacg 


1320 


caggcctctc 


caaaggaagt 


gttntac^ac 


agcaaagggc 


taLcaagaga 


agcacaaaag 


13SO 


gccetaagga 


aataLacgco 


<gac33gg7-.ee 


acggcggcac 


agttggctcc 


scjtaccacaa 


1440 


gtaatggggg 


atacagatgD 


tgctggagtt 


agaaatataa 


tagaatctaa 


c$3atacttfc 


i&oo 


aaaggecct v 


ctgaaccatg 


g&actgcgct 


gttgatggtc 


taaacgaatg 


tgetgttgcc 


1550 


Ctx-ag^gctc 


ttg^agagct 


aattaatcat 


ctgtctaggc 


taaagctaga 


agatgtactt 


162G 


aagcatgggg 


atatctcttc 


ataccaagcc 


racaggggtt 


gectcagaat 


cgatggccag 


1680 


acgacggtaa 


atcttgagat 


atttaacaac. 


agctgtgatg 


gtggtccttc 


agggaccttg 


1740 


tacaaatatc 


ttgataactg 


tgttagtcca 


actggtaagc 


gactctcaag 


gaattggatc 


1BD0 


tgccatccac 


tcaaagatgt 


agaaagcatc 


aatsaaegge 


ttgatgtagt 


tgaagaattc 


1B6Q 


acggcaaact 


cagaaagtat 


gcaAAtcact 


ggecagtate 


tccacaaact 


tccagactta 


1320 


gaaagafitgc 


teggaegcat 


caacptctagc gttcgateat 


cagcctGtgt 


gttCfcetgct 


I960 


cttctgggga 


aaaaagtgct; 


gaaaeaac^a 


gteaaageat 


ttgggeaaat 


tgcgaaaggg 


20*0 


ttcagaagtg 


gaattgatee, 


gttgeeggct 


ctacagaagg 


aaeeaaacat 


gatgagtttg 


2J-00 


ctctataaac 


tctgtaaact 


tcctacatxa 


gtaggaaaaa gegggctaga gttatccctc 


2160 


tctcaattcg 


aagcagecat 


agatagcg 
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*211> L365 

<212* DMA 

<-2l3^ ArabidopBiB t ha liana ecotype Columbia 

«:223> Clona 62 

«400> 27 



cat cacs-cc t c 


1" nt n t t rsc^r'-t 


nc f~ h f r* I - c*>rr 

y ■— l_ '— L. l_ <_ W LJ 


y y triage 


gcngaaacaa 


eg agrLasag 






ciUl L. L^L^aan 


y s 3 »- ^ y 


gt.ggaax.i,ga 


t-Ct y L, tg t lh 


« h* Mm ^ #M p» AJ n. 

gccccaca^a 


±20 




u at gacgagL 


P 4- u. 4- _ a. _ 

LLgLLLLELLa 


aaccctgtaa 


a ct-t.ee cat a 


tcagcaggaa 


180 




•» rV %» h* ■» •» ►» h 

•Ligag l ca t c c- 




t cgaagcagc 


catag5.tagc 


gactctccaa 


240 


I> l~ *" 3 f" p 3 n a a 


-ccaagacgc-y 


a*, aga c-y a<aa 




tctcacsata 


cttatc-^aac 




LlLtLu t-t. is. 


aagagcaact 


caatggtctg 


aggtcat^C^ 


c aetata age 


cgcctaga^3 


360 


fl t- 1> 


ctccgcaar c 


gcagca&gCc 


t cc ct get gg 


aa^catzggee 


aggectgr r.^ 


42 D 


L I_ l_ I. •_ h. L L a 


at cagaagct 


acagatcaga 


sr.cag^^^^c 


iiaaagggcua. 


a tact t: * a* a 


460 


**■ K- w ^ «y y L- 


acggcavcca 


ttt.gcagccg 


c agecgat gg 


t ca act gc c t 


g tt ccgaatg 


54 0- 


1 W*-A 14 C\ la. 1_ 1— 


tggcgaggct 


agaagaagca 








C f\f\ 




aaacacgggc 


ggasaatcaa 








(Jf A 
D QU 


tec ttgccca 


acttggctgc 


tacgtgccgt 


rrt" it a rrt" or r n 


r*"rt *1 A !*• h tr» 




T ■? A 


ctaccttcac 


aaggcctggc gcatctgatn 


^1 c& ^ ^ q ^ kj a v _ 


a vty 1=3 y ^^j^y »■ 




r *> V 


tagaatgcac 


tgagacagcg 


tcagttcttc 


A A A t d c n a c 


V- L.^Xl^ y u ^_ l_ \_ D 






ttg&cgaact 


gggtragagga 


actagtactt 






l a g t. y L L L 


j u u 


ttcgtcacct 


ggtagagaaa 


gttcaatgtc 


ggatgcTctt 


cgcaacacat 


taccacccrt: 


960 


teaccaagga 


Attcgcgtct 


cacecacgtg 


tcacctcjaa 




tgegcattea 


1020 


aatcaagatc 


tg&fctatcaa 


ceacgtggtt 


qcgatcaaqa 


cctagtgttc 


ttgtaccgtt 


1090 


taacxgaggg 


agcttgtcct gagigctacg 


gadttcaagt 


ggcactcatg 


ggtggaatac 


1140 


caaaccaagt 


ggttgaaaea 


gcatc&ggtcf 


ctgctcaagc 


catgaagag^ 


tcaattgggg 


1200 


aaaacttcaa 


gteaiigtgag 


ct^agatdtg 


agttctcaag 


tetgeatgaa 


gactggctca 


1260 


agtcattggt 


gggtatttct 


cgagtcgccc? 


aCAAtJaaHgc 


ccccattggc 


gaagatgact 


1320 


acgac&cttfc 


gttxtgetta 


tggcatgaga 


tcaaatCCtC 


ttactgtgtt 


cccaaataac 


1380 
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ccggg 



<210> 2S 

c211> 34 

<212> DtfA 

cil^i- Arriiicial sequence 
<220> 

*223> WSK6 specific primer 2as for pcr using cDNA of Arnbidop&i*; , 
ttialxana ^cotype CaluTnbis. 

<400> 2€ 



atcrogggtt atttgggaac scragtaagag gatt 34 



<2*0> 2$ 

<21L> 27 

-.212> DNA 

^2l3> Artificial sequfince 
<22C> 

c223> NSH6 specific primei* £82 fcr PCR using cDNA of Arabidopsis 
thali^aa ecc^typ^ cc>lwnfc>ia 

<$00> 59 



gcgttcgatc atcagce&Ct gtgttg:: 27 



<21Q> 30 

<211> 3606 

e212> DWA 

<213> Arabitfopsls thallana ecotype Columbia 
<22fl> 

<221> CDS 

<222> .... (3463) 

<223> AtM£H€ full-length cDNA and deduced sequence OT the encoded 
polypeptide 



aaaagttgag ccctgagg^g tatigtttcc gccatttcta cgacgcaagg cgaaaatttt &0 
tggcgccaat ctttccccce tttcgaatcc ectcagetea aaacatcgtt tctctcccac 120 



tctctctcac aattccaaaa a atg cag 05c cag sga teg aLt ttg tct ttc 

Wet Gin Arg Gin Arg Ser He Leu £er Phe 
15 10 



171 
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ttc caa aaa ccc acc gcg gcg act acg aag ggt ttg gC-t tec ggC gat 213 
Phe Gin Lye fro Thr Ala Ala Thr Thx Lye Gly Leu Val Sex Gly Asp 

15 * 20 25 

gut get age go.c ggg ggc ggc age gga gga cca cga cct aat gtg sag 2S7 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Giy Pro Arg Phe Asn Val Arg 

30 40 

gaa ggg gat get aaa ggd gac get tct gta cgt ttt get gtt teg aaa 315 
Glu Gly Aap Ala Lye Gly Asp Ala Ser Val Arg Phft ALa Val Ser Lya 
45 50 55 

tct 5tc gat gag gtt aga gcja acg gat act cca ccg gag Aag gtt ccg 362 
Ser VaI Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lya Val Pro 
bfi 65 70 

cgt cgt gtc cCg ccg tct gga ttt aag ccg get gaa tec gec gst g*t 411 
Arg Arg Val hou Pro Ser Gly Fhe Lys Pro Ala Glu Ser Ala Gly Aap 
75 50 eS 3D 

get teg tec ctg tec tec aac at t atg cat aag ttt gta aaa gtc gat 459 
Ala 5fer Ser Leu Phc S*r A_^_ llr. war His Ly& Phe Val Lye V> r al Asp 

10O 105 

gac cga gat tgt ICZ g^a gag agg age cga gaa gftC gtt gtt ccg ctg 50"? 
Asp Ar^ Aep cys Ser Gly Glu Arg Bar Arg Glu Asp Val Val Pro Leu 
1IO 115 120 

aat gat tea tdt cca tgt atg aag get aat gat gcr. att cct caa ttc 
Asn Aep Ser Ser Leu Cys Met Lys Ala Aan Asp Val lie Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ser Asa Aan Gly Lys Thr Gin Glu Arc; Abji His Ala Ph£ Ser Phe 
140 145 150 

*3ft 933 9 ct 9 aa ctt a 9 a tca 9t* ?aa gat ata gga gta gat g*JC 6 51 

Ser Gly Atg Ala Glu Leu Arg Ser Val Glu Asp life Gly Val Aap Gly 
155 160- 165 170 

gat gtt cct ggt cca gaa aca cca ggg at*j cgt cca cgt get tct cgc 6 99 

Asp Val Pro Gly Pro Glu Thr Pro Gly Mat Arg Pro Arg Ala Ser Arg 

175 ISO 185 

r 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 7*7 

Lqtj Lys Arg val Leu Glu Asp Glu Wet Ttir Phe Lye Glu Asp Lye Val 
190 195 200 

cct gta ttg ^aci tct aa£ aaa agg ctg &*a atg etc cag gat ccg gtt 7 95 

Pro Val Leu Asp Ser Asrt Lys Arg Leu Lya Wet Leu Girl A&p Pro Val 
205 21D 21$ 
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tgt gga gag *aa gaa gta aac gaa g$a acc aaa ttt gaa tgg ctt 943 

Cvb Gly Giu Lys Lye Glu Val Asii Glu Gly Thr Lye Phe Glu Trp Leu 
220 22 5. * 230 

gag ten t- 1 cga ace agg gat gec aat aga aga cgt cct gat gat ccc 891 
Glu Ser Ser Arg lie Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 2S0 

ctt tac gat aga aag acc tta cac ata CC3 cct gat gtt ttc aa^ aaa &3 9 

Leu Tyr Asp Arg Lya Thr Lau His lie Pro Pro Asp Val Pho Lye lye 
255 260 265 

atg tct gca tea caa aag cas tat tgg age gtt aag agt gaa sat atg 967 
Hftt Ser Ma £er Gin Lya 5ln Tyr Trp Ser Val Lys Ser Glu Tyr Mat 

270 27(> 230 

gac att g!:g ctt ttc ttt aaa gr.g §gg aaa ttt tat g&g ctg tat gag 103 & 

KBp He Val Ley Ptie Phe LyS Val Gly Lys Phe Tyr Glu Leu Tyr Glu 

2 SO 295 

eta gat gcg gaa tta ggt cac stay g^g CT£. gac tgg stag atg acc atg 109 3 

Leu Asp Ala Glu Leu sly Kis Lys tilu Leu Asp Tvp Lya Mat Thr Net 
3 D D .IDS 313 

agt ggt g'jg gga aaa tgc aga sag yti; ggx 2tc tct gaa agt ggg ata 1131 
Sex Gly Val Gly Lys Cys Arg Gin Val Gly He 5er Glu Ser Gly He 
315 320 325 330 

gat gag gcd gcg caa aag ct^ eta gec cg^ gga tat aaa gtt gga cga 1173 
Asp Glu Al=i Val Gin T.iys Leu Leu Ala Axg Gly Tyr Ijys Val Gly Arg 
335 346 345 

ate gag ca^ eta gaa sea tct gac caa gca aaa gec aga ggt get aat 122"? 
He Qlu Gin Leu Glu Thr Ser Asp Gin Ala LyS Ala Arg Gly Alii Asn 
35D 355 369 

act ata att cca agg aag eta .gtt cag gta tta act cca tea sea gca 1275 
Thx He lie Pro Arg Lya Lau Val Gift Val Leu Thr Pro S&r Thv Ala 

270 375 

age gag gga aac ate ggg cct gst gec gtc cat ctt ctt get at a as. a 13 23 

S*r Glu Gly Aarj He Gly Pro Asp Ala Val Hia Lau Lau Ala lie Lye 
380 3&S 390 

gag ate aaa atg gag eta cka aag tgt tea act gtg tat gga ttt get 1371 
Glu He Lya Met Glu Leu Gift Lys Cys Ser Thr Val Tyr Gly Phe Ala 
395 400 405 410 



ttt gtt gac tgc get gec ttg agg ttt tgg gtt ggg tec ate age gat 
Phe Val Asp CyB Ala Ala Leu Arg phe Xrp Val Gly Ser lie Ser Aap 
415 420 425 



1419 
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gac gca tea tgt gec get ctt gga gcg tta ttg atg cag gtt tct cca 1467 
Asp AU Ser Cys ALA Ala Leu Gly Ala Leu Leu Met Gin Val Sar Pro 
43D 435 * 44D 

aag gaa gttj tts tac gac agt aaa ggg eta tea aga gaa gca caa aag 1515 
Lye Glu VaL Leu Tyr Asp S*r Ly b Gly Leu SSr Arg Glu Ala Gin Lys 
445 450 455 

get eta agg aaa tat acg ctg aca ggg tct acg gcg gta cag ttg get 1563 
Ala Leu Arg Lya Tyr ?hr Leu Thr Gly Bev thr Ala Val Gin LbU Ala 
460 465 470 

cca gta cca caa gta atg ggg gat aca gat get get gga gtt aga aat 1611 
Pro Val Pro Gin Val Met Gly Aep Thr Asp Ala Ala Gly Val Arg Asn 
475 430 495 490 

ata ata gaa tct aac gga eae ttt aaa ggt tct tct gaa tea egg aac l£S9 
11* lift Glu £er Asn Gly 'Tyr Phc Lys Gly Ser Ser Glu Ser Trp Asn 

a95 500 505 

tgt gec gtt gat 39 c eta ant £aa cqx: gat get gec etc agt get ctt 1707 
Cys Ala Val Asp Gly L&u Asr. Glu Cys Aflp Val Ala Leu Ser Ala Lb-j 
510 SIS 520 

gga gag eta att aal cat ctg tct agg Cta aag eta gaa gat gta ctt l?BS 
Gly Glu Leu He A&n Kis Leu Ser Arg Leu Lys Leu &lu Asp Val Leu 
52& 535 

Stag cat ggg gat att ttt cca tfcC caa gtt tac agg ggt tgt etc aga 1&03 
Lys His Gly Aep lie Phe Pro Tyr Glii V&'± 7yv Arg Gly Cys Leu Arg 
54 0 54 5 0 

att gat ggc cag acg at^ gta aat ccc gag ata ttt a^c aat age egt lb5i 
lie Asp Gly Gl« Thr Mat Val Asn Leu Glu lie Pfce Abji Asn Ser Cy& 
555 SfiO S65 57Q 

gat ggt ggc ccfc tea ggcf acc ttg tac aaa tat ctt gat aac tgt gtt IB 9 9 
Asp Gly Gly Pro Sar Gly Thr Lbu Tyr Lye Tyr Leu Asp Asn Cys Val 
$15 5S0 535 

age cca acc ggt aag cga etc tta agg aat tgg ate tgc cat cca etc 1947 
6er Pre- Thr Gly Lys Arg Lati Leu Arg Asn Trp He Cys Hie Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gta gtt gaa gaa ttc 192S 
Lya A£p Val Glu Ser II* A»n Lys Arg Leu Aap Val Val Glu Glu Phft 
£05 6lt> 615 

acg gca aac tea gaa agf atg caa ate act ggc cag tat etc cac aaa 204 3 
thr AU Asn Ser Glu 6 r Net He Thr Sly Gin Tyr Leu His Lys 
*20 625 S30 
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cLt cca gac tta gaa aga ctg etc g^a cgc ate sag tct age gtt cg& 209 1 

Leu Pro Aep Leu glu *rg Leu Leu Gly Arg tie Lye Ser Ser Val Ar§ 

tea tea gee tct gtg ttg cct get ctt ctg ggg aaa aaa gtg ctg aaa 2i39 
Ser ser Ala Ser Val Lsu pro Ala Leu Leu Gly Lys Lys vat Leu Lys 
655 £60 665 

caa cga gtt aaa gca ttt ggg caa att aaa ggg ttc aga agt gga 2137 
Glx* Arg Val Ly* Ala Phe Gly Gin lie Val Lys Sly Fhe Arg Ser Gly ; 
670 67S 650 

att gat ctg ttc; ttg get eta cag aag gaa tea aat ajzg atg agt ttg 2255 
lie Asp L€u L*u Lau Ala Leu cin Lys Glu eer Asu Met Met Ser L*u 
6GS 690 

ctt tat aaft etc bgt aaa ctt cct ata tta gta gga aaa age ggg ct& 220 3 
Leu Tyx Lys Leu Cys Lys Leu Pro lie Leu Val Gly Lys 3er Gly Leu 
"700 70S 7in 

gag tta ttt ctt tct caa ttc $aa gca gee ata gac age gac ttt cca 2531 
Glu Leu ?hft L^u Sftr Gin fc?hB GLu Ala Ala lie Asp Ser Asp Pro 
715 ?2D 725 730 

aat tat cag sac csa gat gtg aca gat gaa aac get gaa act etc aca 2379 
Aen Tyr Gin Asi> Gin Asp Val Thr A»p Glu Aan Ala Glu Thr Leu Thr 
735 740 745 

ata ctt ate gaa ctt ttt ace gaa aga gca act caa tgg tct gag gtc 24 27 

He Leu lie Qlu Lev Phe ri* GLu Arg Ala Thr Gin Trp Ser Glu VaL 
?S0 755 "7$[> 

att cac acc ata age tgc eta gat gtc ctg aga tct ttt gca ate gca 2475 
lie Hi 5 Thr lie Ser Cys Leu Asp Val Leu Arg Ser Phe Ala lie Ala 
765 770 775 

gca agt etc tct get gga age atg gec agg cct gtt att tct ccc gea 
Ala Set Leu Ser; Ala Gly 6er Mat Ala Arg Pro Val He Phe Pro Glu 
760 765 790 

tea gaa get aca gat cag aat cag aaa aca aaa ggg cca ata ctt aaa 251 1 
Sar Glu Ala Thr Aep Gin ASIA Gin Ly& Thr Lya Gly Pro He Leu Lys 

7 ?s soo eos 

ate caa gga eta tgg cat cca ttt gca gtt gca gec gat ggc caa ttg 2$i& 

lie Gin Gly Leu Trp His PxD Pha Ala Val Ala Ala Asp GLy Gin Leu 

615 920 B25 

cct gtt ccg aat gat ata etc ctt ggc gag get aga aga age agt ggc 2567 
Pro Val Pro Asa Asp lie l/eu Leu Gly Glu Ala Arg Arg Ser Ser Gly 

630 635 &<LD 



WO 99/1 949* 



57 



PCTjteP9SWW977 



age cat cct Cgg tea Ctg tea ctg acg gga cca aac atg ggc gga 2715 

Ser lie F*is Pro Arg Ser Leu L&u Leu Thr Gly Fro Asn Met Gly Gly 

aaa tea act etc etc cgr. qca aca tcjt ctg gec gtt ate ttt gee caa 27fi^ 
Ly? Sey Thr Leu Leu Arg Ala Thr Cys L*U A lA. Val lie Phe Ala Gin 
960 S65 &■?{■ 

Ctt ggc tgc tac gtg ccg tgt gag tct tgc gaa ate ECC etc gtg gat 2S11 
Lea Gly Cys Tyr Val Pro Cye Glu Ser Cys Glu He Ser Lao V&l Asp 
8"75 390 B85 330 

act ate ttc aca agg Ctt ggc gca tec gat aga Ate atg aca gga gag 2B5 9 

Thr lie Phe Tbr Arg Leu Gly Ala Ser Asp Arg lie. Mat Tftr Gly Glu 

POD 905 

age acc ttt ttg gta gaa tgc act gag aca gcg tea gtt Ctt cag &at 2907 
Ser Thr Phe LcU Val Glu Cys Thr c>lu Thr Ala Ser Val Leu Gin Asn 
91fl 915 930 

gca act cag gat Zt& c-t*. £ta ar.c ^tc gac gaa ccg ggc aga gga act 2955 
i\la Thr Gin Asp Ser Leu Vaj I : v Leu Asp Glu i.e\J Gly ftrg Oly Thr 

^30 935 

agt act ttc gat gga car gec ace gca tac teg get ttt cgt C&C ccg 30G3 
Ser Thr Phe Asp Gly Tyr Ala lie Ala Tyr &er Val Phe Arg His Leu 
940 S45 950 

gta gag aaa gtt caa tgt egg atg etc ttt ges aca cat tac cac cct 3 0S1 

Val Glu Lya Val Gin CyS Arg Met Leu Phe Ala Thr His Tyr Hi b i?ro 
955 960 965 970 

etc acc aag gaa btc gcg tct cac cca cgt gtr acc teg aaa cac atg 309 3- 

Hsu Thr Ijye Glu Phe Ala Ser His Pro Arg V&l Thr Ser tjye Hie Met 
975 S60 9S5 

get tgc gca ttc aaa tea aga tct gat tat caa cca cgt ggt tgt gat 3147 
Ala Cys Ala Fha Lys Ser Arg Sec Asp Tyr Gin Pro Arg Gly Cyfi Aap 

995 1000 

caa gac eta gtg ttc ttg tac cgt tea act gag gga get tgt cct gag 3195 
Gin Asp Leu Val Phe Lea Tyr Arg Leu Thr Glu Gly Ala Cys Pre Clu 
1005 1010 1015 

age tac gga ctt caa gtg gba etc atg got gga ata cca aac caa gtg 3243 
Ser Tyr Gly i^eu Gin Val Ala Leu Met Ala Gly lie Pro Afitl Oln Val 
1020 1025 103Q 

gtt gaa aca gca tea ggt get get caa gec atg aag aga tea att ggg 32 92. 
Val C51u Thr Ala Ser Gly Ala Ala Gin Ala >5Gt Lya Arg Ser lie Gly 
1035 1040 1045 1050 



WO 99.1 W92 FCIVEP9 

2* ■ - 

gga aac ttc aa^gr Cca agt ga£ eta ag3 tct gag ttc tea agt ctg cat 13 3 9 

Glu As>n Pbe Ly£ Ser Ser G1\J Leu Arg Ser Glu Phe Ser Ser Leu His 
1DS5 LOGO 1065 

gaa gac tgg CCC aacj cca tfcg gtg g^t att tct cga gtc gec cac aac 3337 
Glu Asp 'l'rp Leu Lye Ser Leu Val Gly lift 3er Arg Val Ala His Affn 
107Q 1060 

aat gec CCC. att ggc gaa gat gac tac gac act ttg ttt tgc bta tgg 3435 
Aen Ala Pro He Gly Glu Asp Asp Tyr Asp Thr Leu Fhe cya Leu trp 

L090 1095 

cat gag ate aad tec Let tac tgt gtt ccc aaa taaatggcta 5478 
Kls Glu He Lys £er Ser Tyr Cys Val Pro Lys 
1100 1105 

tgacataaca ctatctgaag ctrgttaagt cttttgcctc tctgstgttt atteetccta 3 53 6 

aaaaatgett Btatatcaaa aciAct^*. *•» i ^ c^cga^aaa aaaaaaaaaa aaaaaaaaaa 25*6 

aaaaaaaa $60$ 



<210> Jl 

<2ll> 1109 

*2}2* PRT 

<2i3> Arai>idopsri$ fc/iet liana eco^ype ColumMa 

<223> Polypeptide MSH6 

^40Q> 31 

Met Gin Ar^ Gin Arg Ser He Leu Ser Phe Phe Gin Lya Pro Thr Ala 
1 S 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Aed Ala Ala Ser Gly Gly Gly 
20 25 3D 

Gly Ser Gly Gly Pro Arg Pbe asp val Arg Glu Gly Asp Ala Lys Gly 
35 40 45 

Asp Ala Ser Val Arg Phe Ala val Ser Lys Ser val Asp Glu val Arg 
50 55 60 

Gly Thr Asp Thr Pro Pro Glu LyS Veil Pro Arg Arg Val Leu Pro 9*r 
65 70 * 75 90 

Gly Phc Lysr Pro Ala Glu Sex Ala Gly Asp Ala Ser Ser Leu Phe Ser 

*S 90 95 

v 

Aan He Met His Lya Phe Val Lya Val Asp Asp Arg Asp Cys Ser Gly 
160 110 



Glu Arg Ser Arg Glu Asp Val Val Pro Leu Aan Aap Ser Ser Leu tya 
115 120 125 
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Met. U'/s Ala Asn Asp VaL He Pro Gill Phe Arq Ser Asn ABtt Gly Lys 
i30 135 * 140 

Tbr Oln Glu Arg Ag» His- Ala Pbe Ser Phe Ser Gly Arg Ala Glu Leu 
2.45 150 155 160 

Arg St=t* Val Glu Asp lie Gly Val Asp Gly Asp val Pro Gly Pro Glu 
IBS 17D 175 

Tbr Pro Gly Met Arg Pro Arg Ala s«r Arg Leu Lys Arg Val Leu 01 u 
160 Lfi5 1$Q 

Aep Glu Met Thr Phe Lys Cilu Asp Lys VaL Prfc Val L*\l Aflp Ser Asn 
1$S 2D0 205 

kys Arg Leu Lys. Met Leu Gin Asp Pro Val Cys Gly Glu Lye Lys Gin 
210 215 220 

Val Asn Glu Gly Thr Lys Phe Giu Trp Leu Clu 5c r Sear Jsrg lie Arg 
225- 230 23S 240 

Asp Ala ASH Arg Arg Arg TrC- Asp Aop Pro Leu Tyr Asp Arg LyU Thr 
245 250 255 

Lou Bis He Pro Pre- Asp Vb.1 Phe Ly* Lys Met Ser Ala Scr Gin Lye 
260 265 270 

Gin Tyr Trp Ser VaI Lye Ser Glu Tyr Met Asp lie V£tl Leu Fhe fhe 
27$ ?BQ 2fl5 

hy& Val Gly Lys Phe Tyr Glu Leu Tyr Glu Leu Asp Ala Glu Leu Gly 
29D 295 3o0 

HiB Lys Glu Leu Asp Trp Lys Met Thr Met Scr Gly Va.1 Gly Lya Cys 
30£ 310 215 320 

Arg Gin Va\l Gly lie Ser Glu Ser Gly lie Aap Glu Ala Val Gin Lya 
32$ 330 *35 

Leu Leu Ala Arg Gly Tyr Lys Val Gly Arg Tie Gly Gin Leu Glu Thr 
340 345 350 

Ser Asp Cln Ala Ly& Ala Arg Gly Aid Aafl Thr lie He Pr<? Arg Lya 

Leu Val Gin Val L*u Thr Pro Ser Thr Ala $er Glu Gly Aim He Gly 
370 375 3B0 

Pro Aep Ala val Hi* Leu Leu Ala lie Lys Glu lift Lya Met Glu Leu 
3B5 390 395 400 



Gin Lya Cys Ser Thr Val Tyr Gly Ptv* Ala Phe Val Asp Cys Ala Ala 
405 41P 415 



W0 99V1949S PCIVEJP&MIGOT? 

30 

Leu Arg Phe Trp val Gly Ser lie Sbt ab P Asp Ala Sor Cys Ala Ala 
420 42S- 420 

Leu Gly Ala Leu Leu Met Gin Val. Ser Pro L,ye Glu Val Leu Tyr Asp 
4SS 440 445 

Ser Lys Gly Leu Ser Arg Glu Ala Gin Lys Alft Leu Arg Lys Tyr Thy 

455 4 eo 

Leu Thr Gly Ser Thr Ala Val Gin Leu Ala Pro Val Pro Gin Val Met 
46S 470 415 4B0 

Gly Aap Thr ABp Ala Ala Gly Val Arg Afcn lie lie Glu Ser AST1 Gly 
4SS 4$D 495 

Tyr Phe L,ye Gly S&r 6sr Glu Ser Trp Asn Cya Ala Val Asp Gly Leu 
500 505 510 

Asn Glu Cys Aep Val Ala Leu Ser Ma Leu Gly Glu Leu lie Asn Bis 
515 520 525 

Leu Ser Arg Utu bye Leu Giu Asp Val Leu Lys His Gly Asp He Phe 
530 53S S*0 

PVO Tyr Gin Val Tyr Arg Gly C/S Leu Arg lie ASp Gly Gin Thr Met 
545 550 555 5(5D 

Val Aen Leu Glu He Phe Aan Asn Ser Cys Asp C3ly Gl.y Pro Ser Gly 
565 B7D 575 

Thr Leu Tyr Lys Tyr L-SU. Asp- Asn Cya Val Ser Pro Thr ply Lyo Arg 
5B0 5B5 59(> 

Leu Leu Arg ABtl Trp lie Gys Ki B Pro Leu Lya Ae*p val Glu Ser lie 
595 $00 €05 

Asn Lys Arg Leu Afep Val Val Glu Clu Phe Thr Ala Asn Ser Glu Sot 

615 &2 0 

wet Gin lie Thr Gly Gin Tyr- fceu Hia Lya Leu Pro Asp Leu Glu Arg 
625 630 63S 64C> 

Leu Lftu Gly Arg He Lys Ser Ser Val Arg Ser Ser Ala Ser Val Leu 
645 ' 65D 6S5 

Pro Ala Leu Leu Gly Lys Lye Val Leu Lya Gla Arg Val Lya Ala Phe 
6150 6S5 67 0 

Gly Gin lie Val Lys Gly! Phe Arg Ser Gly He Asp Leu ^U Leu Ala 
«7S 680 $ B5 



Leu Gin Lys Glu Set Aan Met 
690 §95 



Met ser Leu Leu Tyr 
700 



Lys Leu Cys Lys 
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Leu Pro lie Leu Val Gly Lys Set Gly Leu Glu Leo Leu Ser Gin 

^05 710 * 715 720 

Phe Glu Ala Ala lie Aej> Set" ASp Phe Pro Asn Tyr Gin Asn Gin Asp 

730 735 

Val Tbx Asp Glu Aan Alii Glu Thr Leu Thr He Leu iLe Glu Leu Phe 
740 750 

II* Glx* Arg £la Thr Gin Trp Ser Glu Val n e Kia Thar Tie Set Cys 
755 760 765 

Lbu Asp Vy.l Leu Arg Ser Phe Ala lie Ala Ala Ser Leu £er Ala Gly 
770 775 780 

Ser Met Ala Arg Pro val lie Phe Pro Glu Ser Glu Ala Thr Asp Gin. 

^5 790 795 800 

Abx-j Gin LyS Thr Lys Gly Pro Tie Leu Lyc Lie GIji Gly Lfeu Trp Wis 
805 810 815 

Pro Phi* Ala Val Ala Ala Asp Gly Gin L-au Pro V^l Pro Aan A&p lie 
620 625 H30 

Leu l^u Gly Glu Ala Arg Arg Ser Ser Gly Ser lie His Pro Arg Ser 

B4D 645 

Leu Leu Leu Thr Gly Pro ASn Met Oly Cly LyS Ser Thr Leu Leu Arg 

S55 ' 360 

Als Thr Cy& Lku Ala Val lie Phe Ala Gin Leu Gly Cys Tyr val Pro 
^65 B70 675 890 

Cys Glu Ser Cys oLu lie Ser Leu Val Asp Thr lie Phe Th* Arg Leu 

9BS 6*0 395 

Gly Ala Ser A£p Arg lib Met Thr Gly Glu Set Thr Phfc Lsti Val Glu 

005 £10 

Cys Thr Glu Thr Ala 5er Val Leu Gin Asn Ala Thr Gin Asp Ser Leu 
915 920 925 

Val lie Leu Asp Glu Leu Gly Arg cly Thr Ser Thr Phe Asp Gly Tyr 

335 940 

Ala lie Ala. Tyx Ser Val Phe Arg His Leu Val Glu Lya Val Gin Cys 
945 550 355 360 

Arg Met Leu Phe Ala Thr Hi& Tyr Hifl PrQ Leu Thr Lya Glu Phe Ala 
965 970 375 



Ser His Pro Arg Val T*hr Sar Lye Kis Net Ala Cys Ala Pha Lys Ser 
$60 9B5 990 
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Arg Ser Asp Tyr Gin Pro A^g C>ly Cys Afcp Gin Asp L-eu Val Phe Leu 
9&S 10OQ * LOQ$ 

Tyr Arg Lett Thr Glu Gly Ala Cys Pro Glvl 5e>: Tyr Gly Leu Gin. Val 
lOLO 101S 1020 

fcla L*u Met Ala Gly lie Pro Asn Gin Val Val Glu rbx Ala Ser Gly 
1025 *0i0 1040 

Ala Ala Gin Ala Met Lys Atg Ser He Oly Glu Asa Phe I>y& Ser Ser 

1DS0 

GlU Lfcu Arg Ssr Glu Phe Ser S*r Leu His Glu Aap Trp Leu Lya 5 or 
10£0 1065 1G*?D 

L&u Val Sly Ha Ser Arg val Ala His Asn Abu Ala Pro lie Gly Glu 

1075 1 OB D 

Asp Asp Tyr Asp Thr l-f-.u ^ha Cys* LcU Trp His Clu He Lyp S % er St;r 
1090 1095 HOD 

Tyr Cya Val Pro l.ys 
1105 



<210> 32 
<212> DMA 

>:2i3> Artificial sequence 

<220> 

<223> Forward primer for PCR amplification of ATKGEtfEA 

microaatellitc 

<400> 32 

accstgcsta qcttaaactt cttg 24 



<210> 32 
<211> 22 
<212> DMA 

<.215> Artificial sequence 

<220> 

<223> Reverse primer far PCR SWfiplifi cation of ATHGENBA 

micrcsatellite 

<40G> « 

acataaccac aaatsggggt §C ^ 



i 
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<2ll> 
2 X 3 > 


IB 
DMA 

Artificial sequence 


*:22D> 


Forward primer DMClIT-A for PCR 
t ha liana esp, Land&bfcrcr eracta 


<4O0> 


34 






<212> 


IS 
DMA 

Artificial sequence 


<220> 


Reverse primer DMCIN-B for PCR 
thaliana ssp . Lands btrcj erects 


<400> 


35 


agar cgcgag 


aacaztcc 


<21i> 
<212> 


31 

Artificial sequence 


<220> 





13 



13 



023> Forward primer DMCIfT-l lor PCR on genomic DNA of Arabidopsis 

tbaXiana &sp- Landaberg erect a "Ler" 

acgrcgtcgac tcagctatga rjattsctc^t q 31 



<210> 


37 


<211> 




^212> 


DNA ; 


<213> 


Artificial sequence 


<220> 




c23J> 


Reverse primer DMCIW 



thAliana ssp. Landsberg ere^ta "i^r" 

<4O0> 37 



gctctagatc teeegctcta agactztct 



29 
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<211> 32 

--"213> Artificial $e queues 

<220> 

<223> Forward primer dmcim*3 for PGR on genomic DNA of Arab i dope i a 

tbaliana Bap. Landeberg electa "Ler 11 

gctctagagc ttctctt&sg taagtqattg at 32 

<21Q> 39 
<211> 48 
^212:- DNA 

<=2l^> Artificial sequence 

*221> Reverse primer DMCIN-4 for ±>CK on genomic DNA or ArabidOpSis 

C^Aliar-a sBp. Landsberg erecta "Ler" 

<40D> 39 

tCCCCCQCfijC zoy&^aqa^n tccatggttT: cttcagctct atgaatcc 4B 

<2l0> 40 

<2llr> 26 

*2}2:> WTA 

<213> Artificial sequence 
<22D> 

<223> Forward primer DttCla for PGR on ganomic DNA Of Arabidopei* 
tbeliana asp. Landeberg erects "Ler 11 

acgcgtcgac gaattcgcaa gtgggg 2 6 



<210> 41 / 

<211> 3S 

«21i> DNA 

<213> Artificial sequence 
<220> '> 

c223:> Revere* primer DMClb for PCR on gejlDfflie DNA Of Arabidopfiis 
thai lane eap, Liandsbeirg ere eta "Ler 11 



<400> 



41 
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35 

rccatggaga tctcccgggt axjcgatc tgc ttcgaggg 3 6 

--21 0> 42 

<211> 20 

-212 > DNA 

'-21 3 > Artificial sequence 

*22Q> . 

<22S> F&rvartf primer for PGR Amplification of ATeati sslp marker in 
Arabidop£i.s t ha liana eubapeciee 

<4O0> 42 

gCcactgcgt gaatgatatg 20 



<2L0> 43 
<Z12> DMA 

<213> Artificial sequence 

<223> Reverse primer for PCR amplification o£ ATEAT1 SSLP marker in 

Arab i dope is thai i ana subspecies 

C^aacagcca acattaattc cc 22 

<2L0> 44 
c2lj> DMA 

<213> Artificial sequence 

t220> 

<223> Forward primer for PCR amplification of SSLP marker in 

Arabidopaia tha Liana subspecies 

44 

aaccaag^ca c&gaagcg 



<210> 45 

<211> li 

<212> DCTA 

<213> Artificial ssqusace 

<223> Reverse primer for PCR amplif icat ton of MGA63 ssl-P marker in 

Ar-abidopsia thalianas subspecies 



WO 





35 










acccaagtga 


tcgccacc 




<211> 
<2l2> 
<213> 


46 
21 

Artificial sequence 




*220> 

<223> 


Forward primer for FCR amplification of 
Arabidop&is thaliana subspecies 


NGA243 


<400> 












^211> 
<2l2> 


<n 
22 
DNA 

Artificial Bequence 




<220> 
<223> 


Reverse primer for PCR amplification of 
Arabidopsis thaliana Buh-epeciee 




<40D> 






tctgtatctc 


ggtgaattct cc 




*211> 
<212> 
<2l5> 


4& 
22 
DNA 

Artificial sequence 





1* 



21 



22 



<220> 

<22 3> Forward primer for PCR arnplif ication Df tfG;u2a SSLP markar in 

Axabidopaia ttialisna subBpecies 

<400> 4B 

ggtctgttga CCftCgtaagt eg 2 2 



*210> 49 
<211> 22 

<213> Artificial sequence 



<220> 





3? 




Reverse primer for PCR amplification 
Arabidopsis t ha liana subspecies 




49 


atcttgaaac 


ctttagggag gg 


c2l0> 
<22L> 
<2i2> 


50 
22 
DMA 

nt LlIlC lai 5 BqUtJTCe 


~. h . 
<223> 


Forward primer for PCR amplif i cat ion 
ArabicJc^peiB thai* ana subspecies 


<40D> 


SO 


ctgatctcac 




^2lD;> 
•:211> 
<212;- 


20 
DMA 

Art i f icial sequence 


J L/ > 

<223> 


Reverse primer for PCR aTnpl.if icavion 
hv a bi dope is t ha 1 ian a g Ub5 p ec i e s 


-:400> 


SI 


ggctccataa 


aaagcgc&cc 


<210> 
<212> 


52 
21 
UNA 


<223=> 


Forward primer tor PCR amplification 
ArAbidopsi& thaliana eubapecies 

f 


<400> 


*2 


ctccagttgg aagctaaagg 9 


c2XD> 
<211> 
<212> 
<2l3> 


V 

S3 

23. 

Arfcif tcial sequence 



22 



22 



20 



21 



38 



^220> 


Rev*raB primer for PCR amplif icaticrt o£ NGALLL 
Arab £ daps is thai i ana subspecies 


SSLP 


<4oto> 






tgttt tttag 






^2lD> 
<211> 
<:2l2> 


54 
20 

Art if iciil e equEn ce 




<:22 □ > 
*223> 


Forward primer for PCR amplification af NGA16B 
Arabidopeia t ha Li ana subspecies 




*400> 


54 




ccttcacatc 






<21D> 
*-.2ll>. 
<212> 
<213> 


55 
7.0 
DNA 

Art if icial sequence 




<22D;> 
<223> 


Rivers* prlttV?.r for PCR amplification od NtfAlGB 
Arabidopaia thaliana cubepecius 


SSLP 


<40D> 






gcacataccc 


acaaccagaa 




<21D> 
«211> 
<212> 
*213* 


55 
20 
DNA 

Artificial sequence 





21 



20 



20 



<220> 

<223> Forward primes for PCR amplification of ngau.26 SSLP market: 

in Arabidopaia thaliana £Ub6pecioB 

<40Q> 56 



cgctacgctt tteggtaaag ' 



20 
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< 21 D> 57 
<211> 20 

Artificial sequence 

t223> R»Vfttsa primer for PCR ampl ifieation of NGAU2S ssip marker 

in Arafcidopetis cbaliana subspecies 

<4O0> 57 
^caeagt^Ca agtcacaacc 



*21D> 








20 




<212> 


DNA 




<21S> 


Artificial sequence 




220> 






<22l> 


Forward primer PCR amplification 
Arafeutops j s thai i ana subspecies 


Of KGA3 61 


<400> 


58 




aaagagatgs 


gaatttggac 




<210> 


55 






^3 




^^12 > 


DNA 




<213> 


Artificial sequence 










<223> 


Reverse primer ior PCR amplification 
Ar^bidop^iff thaliana subspecies 


of 


<400> 








atattsaagt age 




<2l0* 






<211> 


IB ; 




<212> 


OWA 




<213> 


Artixicial sequence 




<22Q:> 






*223> 


Forward pftuner for PCR arnplif ication of NGA16S 
Arabldopeie thaliana subapecias 


<40O* 


60 





20 



23 



WO 



40 



--2LG> 61 

<211> 22 

<212> DNA 

<2l3=> Artificial sequence 
<220> 

<22l> Reverse priuver for PGR amplification ot NGA158 SSLP marker, in 
Arabidopsii t ha liana subspecieB 

<40D> 61 

gaggacatgt ataggagcct eg 22 

<21Q> 62 

<211> 20 

<2l2> DMA 

<213> Artificial s^quftn^e. 
^220^ 

<223:> Forward primer cor PCR arnp^iC icacion of AthBIOl? SSL:? marker 
in ArAbieJopsif* r.ha liana subspotfles 

<400> 62 

tgacctcccc ttccatggag 20 

<2lD> €3 

c211> 22 

<212> DNA 

c2l3> Artificial sequence 
<22Q> 

<222> Reverse primer for PCR amplification of AthBl02 SSLP marker 
in ArabidOpeife thaliana subspecies 

<r40Q* 63 

ttaacatjaaa cccaaagctt tc 22 



<210> 64 
«2ll> 21 
<212> DMA 

<2l3> Artificial sequence 

v 

<220> 

*223> Forward primer for PCR amplification of AthtJBlQUE marker 

in Arabidopsifif thftliaixa subspecies 



WO 99/19491 



41 



»•. T. t? U ^ 








aggcaaacgt 


ctJitttcatt g 




21 










<211> 










DMA 






<211> 


Artificial sequ&nce 






<220> 








<222> 


averse primer tor PCR amplir icataon 
in Arabidopsis thaliana subspecies 


of 


AthUBEQUE marker 


H ftA. 

■•i *t LP 










agatttCtCC 






<210> 








<2ll> 


21 






<212> 


DMA 








Artificial Jj^q ;tr » 






<220> 








*1 ^ .5 -» 


Forward primer for* PCR amplif icati.Ort 
Arab i dope ie th£ liana subspecies 


of 


NGA172 SSLP msrfcex in 












ttatagcgtc c 




2 1 




67 






<21l> 


13 






<212> 


DNA 








Artificial eequetlce 






<220> 








<223> 


Ke verse primal for pcr amplif ication 
Arabidopsia thai i an A 3ubepecie5 


of 


usxii? sslp marks r in 




67 






Cabccgaatg 


ceattgfctc » 




19 


<2i0> 


6 A 






<211> 


21 







<2l2> DWA v 

<213> Artificial' Sequence 



<220> 



WOW/19492 



FCT/BPWTO97! 



42 

*223> Korward primer Ct>r PCR arnpl i £ ic&cioa of I-JGA12S SSLP marker in 

Arahidopsis chaliana suhBpecias 

gasaaascgc tactttcgtg g 21 



*2io> 
<211> 

<213> 


&9 
22 

Art i f i c i a 1 s equence 






*220> 

<223> 


Reverse primer for E>Cft amplification of NGA12G 
^rabtdopsis thaliana subspecies 


SSLP 


tnarksr in 


<40Dp 










atcaagagca gc 




22 


<211> 
<2l2> 
*2L3> 


70 
20 
DMA 

Artificial ^eguence. 






<220* 
<223> 


Forward primer for PCn amplification of MGA162 
Arabidopsie tbaLiana subspecies 


5SLP 


marfcer in 




70 








gcatc cgagg 




20 



-c2l0> 
<211> 
*212> 
<213> 


71 
i2 
DNA 

Artificial sequence 




<22D> 
<223> 


Reverse primer for PCR amplification o£ KGA162 
Arabidopeis tbaliana subspecies 


SSLP niarkar in 




71 




ctctgtc&cfc 


CttttcctCt gg 


22 


<210^ 
<211> 
<212> 
<2ll> 


72 
21 
OTA 

Artificial eequenc 





WO 9492 
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x-220> 

■=33 3* Forward primer for PCR arnplif teat ion of HGA5 SSLP marker in 

Arabidopsis thaliana subspecies 

< 400> 72 

tggatttcizt cctctcttca c 21 
<211> 21 

<213> Artificial science 

<223> Hovarse primer for PCR amplification of NGAb SELF marker in 

ArabidopBis thelisna subspecies 

<40Q> 11 

a tgejagasac t.tiacactga L %~ o 1 

«=212> DMA 

<213> Artificial sequence 

■=223=- Forward primer lor PCR amplification of NGA12 SSLP marker in 

Arabidopeis tha liana BubepecieB. 

<<QQ> 74 

aatgttgtcc tcccctcctc 2f) 



<2\0> 75 

<211> 22 

<212> dkA 

-c2I3> Artificial &*qa*hce 

<223> Re verse priroex for PCR Amplification of SSLP marker in 

Axabid psis th&liana subspecies 

rz4bD > 75 



Cgatgctctc tgaascaaga <jc 



32 



WO 99/19492 



J'CT/K J 9&''069'?7 
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<210> 


76 






<2l 1 > 


21 








DMA 






<213> 


Artificial sequence 














<223> 


Forward primar for pcr amplification 


of 


NGA8 SSLP marker in 




76 






gagggcasat 


Ctttatttcg g 




21 




77 






<211> 


22 






<212> 


DUh 




• 


^2l3> 


Artificial sequence 






<22D> 








<225> 


Reverse prirrver fr^r PCR amplification 
Ar aB i oops is c r. ■ ,\ i ;ci sub &p*c 1 6 & 


of 


NGA6 Ssip marker in 


<40D> 


77 






tggctttegt 


ttataaacat cc 




22 


<2i0> 










21 






<212> 


DMA 






<2l3> 


Artificial sequence 














<223> 


Forward primer for PCR finnpli f icac ion Ot 
in ArabidopeiB ttialiana. subspecies 


NOAH 07 marker 


<400> 








gegaaiaasac 


aaaaaaafccc a 




21 




13 






<211> 


21 t 






<212n 


DNA 






<213> 


Artificial sequence 






O20> 








<*223> 


Raver ee primer for PCR amplification 
in Arabidopeis thai i ana subspecies 


Of 


NGA1107 SSLP marker 



<40D> 



79 









PCTVEP9W06577 




45 










^.Cagiaat tag g 






2l 


•:210> 


91? 










J 1 








-:212> 










*213> 


Art i £ i cial a ecjuenca 


















<223> 


Forward primer for PCR acnpLif icat ion 
ArabitfppsiG Cha liana, subspecies 


Qt 


HGA22S 


£5 LP mirkar in 


<4D0> 


ao 










C Cccagsgag g 






21 


<2l0> 


Bi 








+r 4t J. > 










<212> 


SNA 








<213> 


Artificial sequence 


















<222> 


Reverse primer for PCU empli f icst ion 
Arsbidopeie t; ha Liana subspecies 


of 


W3A22 5 


S&Lp marker id 


<400> 










cct cccc act 


agttttgtgt cc 








<210> 


B2 


















<2l2> 


DMA 








<213> 


Artificial sequence 








•cZ20> 










<223> 


Forward primer for I>CR amplification 
Arabidopsis thaliana subspecies 


of 


NO A3 4 9 


SELF marker in 


<400> 












ttca.tcgcc 






l & 


<210> 


63 




















DNA 








<213> 


Artificial sequence 

V 








<320> 










<223> 


fteverza primer for PGR amplification of 
Arabid&psis thaliana subspecies 




5SLP marker in 



PCT/EPWG69T7 



49 

ggatcccsaa ctgtaaaatc ci- 



22 



<210* 34 

--212* DNA 

<2i3> Artificial sequence 

<220> 

<323> For varcS p rimes r for pcr amplification of Chi 2 SSLP marker in 

Arabidopsis tbaliana Subspecies 

<4O0* B4 

aatcccagte accaaacaca ca 25 

<:2L£>> 95 
<2L1;> 20 
<2l2> £na 

<220* 

<223* Reverse priTnex for PCR ampli f icanion of CI\12 SS^p marker in 

Arabi clops is cha liana subspecies 

<400> fl5 

cccagtcta^ ccacgaccac 20 

c21l> 20 
<212> DNA 

Artificial sequence 

<220> 

<25 ** Forv/ard primer for Ptt amplification of NGA1S1 SSLP marker in 

Arabidopais thsliatlfc fcubapecieB 

<400> 86 

gtttcgggaa gttttgctgg , 2Q 

<21D> B7 

<;21i> 24 

<s212> DNA 

<Jli> Artificial sequence 



<22Q* 



W099/1M91 



47 



^-223 > 


r'i'orsQ piiuier tot PCR amplification of 
ArafcitiOpBis t ha liana BubapEcies 


M0A151 


SSLP 


marker in 


• ^ '. J V - 










cagtctaaaa 


SC^Agagtat gatg 








<210> 


Ba 








<2ll> 


32 










DNA 








<213> 


Artificial sequence 








<220> 












ror^ara primer for pcr. amplification of 
Arabidopaia thai i ana sub-spacies" 


NQA106 


SSLP 


marker in 












gt tatggagt 


ttcragggcs eg 






27 


<21{^ 






















POT* 








<213> 


Arc! f icial sequence 








<220 > 












Reverse primer for PCR amplif ica&ion of 
Arabidopais t ha liana subspecies 


NGA106 


SSLP 


markar in 












tgccccatcn 


tgttcttctc 






20 



<210> SD 
<212> DNA 

<213> Artificial sequence 

<;22D> 

<223> Forward primar for PCS amplification *£ NGA13 9 SSL? merger in 

ArabidDpaiB Lhaliana subapaciea 

agagccacca gatccgatgg 2 0 

v 

<210> 32 
-c215s dna 

«c2i3* Artificial sequence 



W0 99A9491 



PCIWP9S.'fl6»77 
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-:22Q > 








< 2 2 3 > 


Itewrs* primer fcr PC5L ampl if 1 cat ion 


of 


9 sslp marker in 




Ar;lbidopaia thai l ana. aufcsp^cl'ieB 






4 OD > 








ggtttcgttt 


cactatocag g 




21 


<21Q> 


92 






<211> 


22 








DNA 






<21?> 


Artificial sequence 






-:220> 








*223> 


Forward primer for PCR amplification 


of 


NGA7 6 SSLP marker in 



Arabi<3opeis tha liana subspecies 



*40(>> 


32 






tcaccctcc* cc 


22 


*2l0:- 

<212;> 
<213> 


20 
DNA 

Arti f icifil sequence 




<220> 
<223> 


R*v*rse priTi>cr for pcr amplification of JTGA7G SSLP inarker in 
Arab i daps ifl t ha liana Stib£pe.ciea 


<4Q0> 


93 




aggcat^gga gacatttacg 


20 


<21C> 
<211> 
<r212> 
<213> 


94 
20 

Artificial sequence 




<22Q> 
«223> 


Forward primer for PCR acnplif teat ion of ATHS0131 
in ArAbidopais tbaliana Subspecies 


bslp marker 


<40Q> 


94 





ctccaccaat catgoaaatg 



20 



49 



PCTVEF98/W977 



<210> $5 

<--2l2> DNA 

Artificial sequence 

c220> 

<223> Reverse primer for PCR amplification of SSLP marker 

in Arabiriopais thaliana subspecies 

>;4Q0> 95 

fcgatgttgat ggagatggtc a ^« 



<210:> 


96 




22 


<212> 


DMA 


<213> 


Artificial sequence* 


*22D> 




<22*> 


Forward primer tor PCR amplification 




ArabidOp^is- th-rt liana aubspecie-S 


<4O0> 




teag-gaggaa 


cfcaaa-gtgaCf <jg 


<210> 


97 


<2ll> 


22 


<212> 


DNA 


<213> 


Artificial sequence 



22 



<223> Reverse primer for PCR Amplification of nsju2 3 marker in 

Arabidopsis thaliauna auhapecLeD 

*40Q> 97 

cacactgaag atggtettga gg 22 



<210> 




<211> 




<212> 


DNA 




Arabidopsis t ha liana dcotype Coluiribia 


<220> 




<223> 


Genomic Dlih Sequence of ACMSK6 


<400> 


97 



tfcttfctggtt gciaacaats aaggtatacg gtttutgec atcaatataa etatatataa 



WO W194R 
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aagaaAtgaa ayatatatat tgtttt tcca tttateaaac sssanaacaa gacttttttt i2p 

ttacttttta cattggtcaa caaaatacaa gataaacgac arcgsttaat, cattteeeaa 19 u 

tttfcaCGCCX aagtttaaca fcctagaacct tctccatctt cgcaagcaca gcctgattag 24 0 

gaacagcttt aceactcfcca tattcctgaa ctacctgagt cctctcattg atctgtttcg 300 

ccaaatcc^c ttgtgacatc ttcttctcca atctcgctct ecgtatcatc aacctcacct 3 ; 5o 

ctgctttcac acgatccatc gccgciggct ctgtucttc ctccagcttc ttcgfcgttaa 4£0 

rcaccggaac cgccgtagst; ttcccctttt tgttcgaacc ggcatcgaat ttcfctaaccg 4 8Q 

tttgaaecgc gacaccgttt ctcagagctg cgttaaccge t^ccggatcg cgtaggfcctt $4 0 

ggctcttttg ctttgatctg tgg.agaacta ctggttccea gtcttgtgtt actgctcctg F00 

ggtatccgct cggcetcgtc gatgaesttga gagsaaggaa caacgcgaaa atfcttattaa 660 

tctgagc£tt gaaattgaga aacgatgaag aegaagaatg tigttgagag gattgtgata 720 
tttatatata cgaagatcgg tttccggaga attcgatcst CZZZlZCtCv attttcgtct 7&0 
Ctggaacgtt cttagagatg attgacgacg tgtcattatc rgatttgcag ttaaccaatg B40 

ctttttgggt tggatCcgig gtacaccata ttatccgatc ^ggctcaatg gcttiCAtata 9 00 
aattcggci:c tcggtteggt tatgagtcat eafctaaaatt sagctaaeca aaaattttcg 9 60 

taaaatttst ttcggtttca actcggatcc cttacttcca gaaccgaatt attcgaaacc 10 20 

ggggctragcc gaaccgaats ccaatgcctg attga^tcgc tgcctagaaa gatccaacgg 10 60 

tatacaataa tagaacataa stcggacggt cafccaaagcc tcaaagagtg eacagtcaac 1140 

aaaaaaagtt gagcccfcgag gagcatcgtt tccgccattt ctacgacgca aggcgaaaat 12 DO 

ttttggcgcc aatCtttCCC CCCtfctfrgaa ttctctcagc tcaaaacatc gtttctctct 1260 

caetetcect cacaattcca aaaaatgcag cgccagagat cgactttgtc tttcctctaa 1320 

aaacccacgg cggcgactac gaagggtttg gtttccggcg atgctgcta^ cggcgggggc 1380 

gg^agcggag accacgattt aatgtga&gg aaggggatgc caaaggcgac gcttdfcgc&e 1440 

gttttgctgt ttcgaaatct gtcgatga§§ ttageggaac ggacactcca ccggagaagg 1SOD 

ttccgcgtcg tgtcctgccg t'fctggattta agccggctg* atccgccggt gatgcttcgb IS 60 
ccctgttctc CAACAttatg cataagtttg taaaagtcga tgatogagat tgttctggag 

agaggtacta afcettcgatt ctcttaattt tgtcatcttt agccggaaga agaagattcg ISBd 



WO 99/1 94& PCnEPWWfiSTT 
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tgtaatttgt 


cgtattcgtt 


ggagagatr.c 


tgattactgc 


aCtggatogt 


tgt&tacaaa 


1740 


ttttcaggag 


ccgagaagat 


gbtghtccgc 


tgaatgafccc 


atctctatgt 


atgaaggcta 


LSOO 


atgatgt Lac 


tcctraattt 


cgttccaaca 


atggtaaaac 


fccaa$aaaga 


aaccatgett 




ttagttcca$ 


tgggagagct 


gaacttagat 


cagtagaaga 


tataggagta 


gatggpcgatg 


192G 


c tcctggcoc: 


agaaacadca 


gggatgcgte 


cacgtgcttc 


tcgcttgaag 


cgagttctgg 


1990 


aggatgaaat 


gacttfctaag 


gaggata&gg 


ttcctgtatt 


ggactctaac 


aaaaggctga 




aaatgctcca 


ggatccggtt 


tgcggagaga 


agaaagaagt 


aaacgaagga 


accaaatttg 


2100 


aatggcttga 


gtcttctcga 


atcagggatg 


ccaatagaag 


Acgtcctgat 


gatceccttt 


2160 


acgatagaaa 


gaccttacac 


atsccacctg 


atgttttcaa 


gaaaatgtct 


gsaXCacaaa 


2220 


egcaatatcg 


gft^egtcaag 


agtgaata&a 


tggacattgt 


gcttttcttt 


aaagtggtta 


22SO 


gtaactatira 


atctagtgtt 


caat ccatct: 


cctcaatgcg 


atttge tcae 


t tacatctgt 


2340 




Ctcttctcat; 


gggaaieiLtt t 


atgagctgta 


tgagctagat 


gcggaattag 


2400 


gtcacaagga 


gcttgactgg 


aagacgacca 


tgagtggtgt 


gggaa&atgc 


ftgacatggtaa 




actagttgaa 


acAactggcC 


egsntgaatt 


attgtgtcta 


taaattttga 


caeca ccttt 


5520 


tgtt tcaggt 


tggtatctct 


gaaagtggga 


tagatgaggc 


agtgeaaaag 


ctatttAgctc 


25BD 


gtgggtaagg 


ga^ccatcat 


actttatgga 


attcgtttac 


tgctacttcg 


gctaggatct 


2G40 


aagaaatgga 


aatcscttca 


agcatcatta 


gtxaggatcc 


tgagaactca 


ggatgttttc 


2700 


ttattcgtta 


tataataagt 


cttttcatca 


aggagtaaca 


aacaaaactt 


gcacaacatfc 


2760 


tgtgegc&ea 


ctggcaaggc 


atatatacce 


agctaaccct 


tgctagttca 


ctgtagcaac 


2920 


agttacggat 


aaeatatgtt 


tacttgtatg 


tggtaccrtc 


attttgtctc 


tcatggaggc 


:eeo 


tttcaagcct 


tgtgetgaaa 


ctggacagtt 


acatatgctt 


ccaacagaaa 


etagcatgea 


2*40 


gattcatatg 


ctttcctatt 


ctactaatta 


tgtattgaca 


cactcgttgt 


ttcttttgaa 


3D0Q 


agatataaag 


ttggacgaat 


cgagcagcta 


Cjaaacatctg 


accaagcaaa 


agecagaggt 


306Q 


gctaatact§ 


taagttttet 


tggataggtc 


aaggagagtg 


ttgcagactg 


tttttgatca 


2120 


tttctttttc 


tgtacattac 


frttc&egctg 


taattaactc 


aa,tggctatt 


ctggtctgat 


3180 


tatcagataa 


CCCcaaggaa 


gctagttcag 


gtattaactc 


c&cca&cagc 


aagcgaggga 


3240 


aacategggc 


ccgacgccgt 


ccatcttctt 


gctataaaag 


ag^tttgtta 


tttacttatt 


3300 
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tatcT^catca tgttcagttc atccaagtcc tg&aaaatta cactcttctt taccaatctt S360 

Ctatcaagct gfcgtaaagga tttggaatta gaaaateatE atttgatgct, tfcgttttAbA 3420 

tgca&gaggt tcccttgaaa agacctgttt aagattcttt gcaceegafca aattcaatct 3430 

ttttaagtga etcccctact ttcttacaat gaecatagtc tgceattgca tgtcaagtaa 3&40 

tatcattcct tgttactgca tccccctctt tcttaatgac cattgtctat gttgtgtttg 35.0 0 

tctcgtgtgc! tggagaaaat gatagctgat ccaagctgtfc cattatcatg attaagtagc 3 6 60 

tgcteaggaa ttgcctttgg ttacatdgcc taatggtttg atgtcaattt ttcbtctgsa 3 720 

dcttcacttt agatcaaa&t ggagctacA* aagtgttcaa ctgegtatgg atttgctttt 3 7B0 
gttgactgtg ctgccttgag gttttgggtt gggtccatca gcgatgatgc atcatgtgct 

gctcttggag cgttattgat gcaggtaagc aagt^tatdc tgtatcttAt: gtgtaccatg 3500 

tga-Cttcctg tgcat^tart tgggtr^cag gaactaattc tgaatcscca tttggtatgt 3960 

tttttreagg tttctccaaa ggaagtgtta tatgacagta aaegt&aact ^Cfctgtatcg 4020 

ccagttgtct tgtdaaacag aatstaaggc aaatgacact ggttaattts aagtgcatac 4033 

atgttgaaat attgcagggc tatcaagaga agcacaaaag gctetaagga aacatacgct 414 0 

gacaggtacc atttcagtag gcaagctaac tgae&attta accgctcacc gaatgatagg 42 00 

tctettAAac Attgdtaatg tagatgatgt ttatgtttca atctaatagg gtctacggcg 42 60 

gtacagccgg ctccagtacc acaagtaatg ^gggatacag atgctgctgg agttagaaac 4*20 

ataatayaat ctaacggata ctttaaaggt tcttetga&t catggaactg tgctgttgst 43 BO 

ggtctaaatg aatgtgatgt tgcccttagt gctcttggag agctaattaa tcaectxftct 4440 

aggetA&agg bgtgttggct tgcttagttt ttgcttttca caaattiagc aaaiggaactfc 4 5 00 

ttcataacfct acagtttcta tctacttgca gctagaagat gtacttaagc Atggggatat 4 550 

ttttccatac caagtttaca ggggttgtct cagaattgat ggccagaega tggtaaatct 4 65 0 

tgagatattt aacaatagct gtgatggtgg tccttcaggc aagcgcatat ttcttttteg *6 6 0 

ataacctcaa ctagagggca gacatagaag gaaaaattcfc aatacttcgt acggatctcc -a 74 0 

agtaagtaat agccgatttt tgtttaqcta fcgtagggacc ttgtacaaat aecttgataa 4B0 0 

ctgtgttagt ccaactggta agcgactett aaggaaccgg atctgccatc cactxa&aga 4B6Q 

tgtagaaagc atcaataaae ggcttgatgt agttgaagaa ttcacggcaa actcag&aag 4*2 Q 



wo 99tmn 

fcatgcaaatc actg^ccagt atctccacaa 
catcaagtct agcgttcgat cat cag cote 
gctgaaacaa cgagtaagta tcaa tea cast 
ataggaefcaa aacatfcaegg gtctagctaa 
ttattcatta catttctctfc aacttattgc 
gaaagggttc agaagtggaa ttgatctgtt 
gagtttgett tataaactct gtaaacttce 
adtt-CtttCt caattcgaag cagecataga 
ctatctteca tacttcaska eaaa&tgtct 
gatctcaact cacaccccga g^atcctaaa 
gatggtttga tttatttcta acttgaactt 
sgacgaaaac gccgaaactc tcaeaatact 
atggcctgag gtDattcaca rcataagctg 
agcaagtctc tctgctggaa gcatggccag 
agatcagaae Cagaaaacaa aagggccaat 
tgcagttgca gccgatggte aattgcctgt 
aagaagdagt ggcagcattg atCCCcggtC 
aa*ateaaefc fcttcttcgtg caacatgtct 
cgttagataa ttactctatt efcttgeaate 
ttctgtctgc agcttggccg cfcacgtgccg 
ac&atcttca caaggcttgg cgcatctgafc 
tefceaaaata ccaattccfcc gaactattta 
ttt^cttttt tttaggtacc tttttggtag 
atgeaaetea SSattcacta gtaatccttg 
atggatacgc cattgeatae tc^gnAacct 
caacaaaaac atgeaattea ttttgctgaa 
cctggtagag aaagfctcaat gteggatget 
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acttccagac 


ttagaaagac 


Cgctcggscg 


43.5a 


tgtgttgcct 


gctctcctgg 


ggaaa-saagc 


5040 


gttttctgag 


taat-gectte 


catgagtagt 


■ 510* 


agactgttct 


ccttcittttg 


caatgtctgg 


51*l> 


attgcatjgfct 


aaagcafcttg 


ggcaaatt^t 


525f> 


gttggctcta 


cagaaggaat 


Caaatatgat; 


5£B0 


tAtattagta 


ggaaaaagcg 


ggctagagtt 


5540 


tagegacttt 


ccaaattatc 


eggtgeceat 


5400 


gtc acta etc 


aaagcaatgc 


atatggecta 


54SD 


gggattcget 


tnttattcct 


aatgtttttg 


5520 


aCtaatcttg 


taccagaacc 


aagat^tgac 


55*0 


tatcgaactc 




gageaactca 




cctagACgtc 


ctgagatctt 


ttgeaatege 


$?D0 


gcctgttatt 


tttccogaat 


cagaagctac 


57S0 


acttaaaatc 


caaggactat 


ggcatccatt 


5020 


tccgaatgat 


ataeceetfcg 


gcgaggctag 


SBB0 


attgttactg 


aegggaccaa 


acatgggegg 


5940 


ggocgttatc 


tttgcccaag 


tttgtatact 


bODO 


agttcctcaa 


catgaataat 


aAAttctgct 


60ff0 


tgtgagtcct 


gegAaatcbc 


ectcgtggat 


S12f> 


a^faatcatga 


caggagagag 


taagttttgt 




ctcagatttt 


gtctgattgg 


acaaggtggt 




aatgcactga 


gaeagegtea 


gttcttcaga 




acgaactggg 


cagaggaact 


agtactttcg 


S36Q 


gctcttctcc 


ttcaacttat 


aCdtgttgat 




acttattgat 


ttatateagg 


tttttogtca 


G4BO 


etttgeaaca 


cattaccRCc 


ctctcac<?aa 


6540 



WO 99/1 949 1 



PCT/EP9S/W5977 



54 

ggaaetcgcg tmcacccsc gtgtcacctc gaaacacatg gcttgcgcat tcaaatcaag 

atctgattat caaccacgtg gttgtgatca agaectagtg ttcttgtacc gtttaaccga 666 0 

gggacjcttgt cctgagagct acggacttca agtggc&cte acggctggaa Caccaaacca S^SD 

agtggcegaa acagcateag gtgcfcgcfcca agccatgaag agatcaattg gggaaaacte S7B0 
caagtcaagt gagccaagat ctgagttctc aagtctgcat gaagactggc tcaagtcatt 

ggtgggcatt tctogagtcg CCCacaacaa tgeccccatt ggcgaagatg actacgacac 6900 

tttgttttgc ttatggcatg agatcaaatc ctcttactgt $ttce:eaaafc aAatggqtat 5960 

ga.cataacac tatctgaagc tcgttaagtc ttttgcttct Ctgatgttfca ttcctcttaa 702 0 

aaaatgcfcta tatatcaa&a aategtttec tcgattataa caagattata tatgtafcctg 70 8 0 

tcggtttagc tatggtatat aatatatgta fcgctcaLgag attggtcaag agaaatacto 714 0 

acaaac&sta tattaagaag gaaaeatgtt tatgcattaa tttaagtttc aagataaact 73 DO 

gcaaataacc tcgactaaag ttgcaaagac caaacaeaaa ttacaaaact tataagactt 726Q 

aagttctgaa ttccctaaaa ccA&aaaaaa aaacagaaca tattttgttg catctacaaa 73 20 

caacacaaac ctacatagtt tataacttac tcatcactga ^attaacafce agaiaecaifcee 75513 

tccatctctt Catcttcact ctratcatca tcaccaccac catgatgatt ctcctcctct 744& 

tcaggtaacc tagcaatctc actctgagcfc ctatcaacaa tctgcttctt ctgea.actcc 7500 

saatccctct gaaaatcagc tctcatcttc tccaactcct tcaettgetc cctettaetc 75*0 

ttctccatct tctcataaae etteccaaac ctctcaacag aatccgccaa catcttacac 7G20 

gaagcagcgt cattaacctt ettcctctcg tactcaacct catcatcctc atcctcctcc 76 B0 

tcttcagaat caccaggact ate cat cats tcat&eaaaec cattagactt atctaaataa 7740 

acettagtgt tcataaacac aaactcacct gaateaacac eacaagceaa acctaaatcc 7SO0 

gacttgggcg aaacacaaag caacatatcc aacttattga aaaAcgacea tttacttgaa 7$6& 

ectaaacctg atttctcaac cttaatcttc tcttttctac acttcctctt caagtcatca ^920 

afceantetee tacattgegt ctcagatttc tccatcctta gctectcact eactttctca 7 930 

gctacttcat tccaatcctc ^ttcctcaaa ctccttctac ecaattgeaa aaacctatct B04 0 

CCCCaaactt Caagcaacac aa 8 0 62 
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